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ANALYSIS OF COVARIANCE: ITS NATURE AND USES* 


G. Cocnran 
The Johns Hopkins University, Baltimore, Maryland, U.S.A. 


1. INTRODUCTION 

This paper is intended as an introduction to the subsequent papers 
in this issue. It discusses the natuse and principal uses of the analysis 
of covariance, and presents the standard methods and tests of sig- 
nificance. 

As Fisher [1934] has expressed it, the analysis of covariance ‘“‘com- 
bines the advantages and reconciles the requirements of the two very 
widely applicable procedures known as regression and analysis of 
variance.”’ This dual role can be illustrated by a two-way classification 
in which rows represent treatments, and columns represent blocks or 
replications. The typical mathematical model appropriate to the 
analysis of covariance is 


= + — 2.) +4; (1) 


Here y,; is the yield or response, while z;; is an auxiliary variate, some- 
times called the concomitant variate or covariate, on which y;; has a linear 
regression with regression coefficient 8. The constants yw, 7; and p; 
are the true mean response and the effects of the ith treatment and jth 
replication, respectively. The residuals e;; are random variates, assumed 
in standard theory to be normally and independently distributed with 
mean zero and common variance.** 

From the viewpoint of analysis of variance, equation (1) may be 
rewritten as 


In this form, (2) is the typical equation for an analysis of variance 
of the quantities 


Yis — Bris — 


*Paper No. 319, Department of Biostatistics. 
**The symbols rz. ., y. . denote overall means, while zi ., yi . denote treatment means. 
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These are the deviations of y;; from its linear regression on z,; , or 
the values of y,; after adjustment for this linear regression. In this 
setting, 7, may be regarded as the true effect of the 7th treatment on 
y.:; , after adjustment for the linear regression og the covariate 2,; . 
Thus the technique enables us to remove that’ part of an observed 
treatment effect which can be attributed to a linear association with 
the z,; . 

When the objective of the analysis is to fit a regression of y on z, the 
parameters 7; and p; in equation (1) represent ‘‘nuisance’”’ parameters, 
included in the mathematical specification in order to make it realistic. 
In this way the analysis of covariance extends the study of regression 
relationships to data of complex structure in which the nature of the 
regression is at first sight obscured by structural effects like the 7, and p,. 


2. PRINCIPAL USES 
These may be grouped under several headings. 


2.1 To increase precision in randomized experiments. This is probably 
the most frequent application. The covariate z is a measurement, 
taken on each experimental unit before the treatments are applied, 
which is thought to predict to some degree the final response y on 
that unit. The first illustration of the covariance method in the litera- 
ture was of this type (Fisher [1932]). The variate z was the yield of 
tea per plot in a period preceding the start of the experiment, while 
y was the tea yield at the end of a period of application of treatments 
(in this illustration, the treatments were “dummy’’). Adjustment of 
the responses y for their regression on z removes the effects of variations 
in initial yields from the experimental errors, insofar as these effects 
are measured by the linear regression. In this example these effects 
might be due either to inherent differences in the tea bushes or to soil 
fertility differences that were permanent enough to persist during the 
course of the experiment. : 

With a linear regression equation, the gain in precision from the 
covariance adjustment depends primarily on the size of the correlation 
coefficient p between y and z on experimental units (plots) that receive 
the same treatment. If o; is the experimental error variance when no 


covariance is employed, the adjustments reduce this variance to a value 
which is effectively about 


2 1 


where f, is the number of error d.f. The factor involving f, is needed 
to take account of errors in the estimated regression coefficient. If 


, 
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p is less than 0.3 in absolute value, the reduction in variance is incon- 
sequential, but as p mounts towards unity, sizeable increases in precision 
are obtained. In Fisher’s example p was 0.928, reflecting a high degree 
of stability in relative yield of a plot from one period to another. The 
adjustment reduced the error variance roughly to a fraction {1 — 
(0.928)"}, or about one-sixth, of its original value. Some of the most 
spectacular gains in precision from covariance have occurred in situations 
like this, in which the covariate represents an initial calibration of the 
responsiveness of the experimental units. 

In this use the function of covariance is the same as that of local 
control (pairing and blocking). It removes the effects of an environ- 
mental source of variation that would otherwise inflate the experimental 
error. When the relation between y and z is linear, covariance and 
blocking are about equally effective.* If instead of using covariance 
we can group the units into replications such that the z values are 
equal within a replication, this blocking also reduces the error variance 
to o3(1 — p’). 

The potentialities of preliminary measurements as a means of 
increasing precision have frequently been recognized by experimenters. 
In animal feeding experiments the response is taken as gain in weight 
(final-initial weight) rather than final weight itself. Insulin may be 
assayed from the drop in blood sugar (initial reading—reading 3 hours 
after injection of insulin) instead of from the 3 hour reading. The 
weight of a treated muscle on the right side of the body may be taken 
as a percentage of the weight of the corresponding untreated muscle 
on the left side. Such adjustments make the best use of the covariate 
only when the relation between y and z is exactly that implied by the 
adjustment. In the animal feeding and insulin examples, the assumption 
is that 8 = 1; in the muscles, that y/z is independent of x and has 
constant variance. If these assumptions do not hold, the adjustment 
falls short of the optimum and sometimes is worse than no adjustment 
at all. By a covariance analysis, the experimenter can utilize his 
knowledge or speculations about the general nature of the relation 
between y and z, but still leave flexibility in the process by including 
parameters like 6 that are estimated from the data. Incidentally, he 
can verify from the covariance analysis whether a specific simple 
adjustment like the use of (y — xz) is good enough, as it sometimes is. 

In a covariance analysis, the preliminary variate r may be measured 
on a completely different scale from that of the response y,—a situation 
in which the experimenter would have difficulty in creating a “home- 
made” method of adjustment. Bartlett [1937] used a visual estimate 


*See p. 281. 
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of the degree of saltiness of the soil to adjust cotton yields. Federer 
and Schlottefeldt [1954] used the serial order (1,2, --- 7) of the plot 
within a replication as a basis for a quadratic regression adjustment of 
tobacco data, thereby removing the effects of an unexpected gradient in 
fertility within the replications. Similarly, the reading performances of 
children under different methods of instruction may be adjusted for 
variations in their initial I.Q.’s. Note also that z need not be a direct 
causal agent of y—it may, for instance, merely reflect some charac- 
teristic of the environment that also influences y. 

When covariance is used in this way, it is important to verify that the 
treatments have had no effect on xz. This is obviously true when the 
z’s were measured before the treatments were applied. But sometimes 
the z variates are measured after treatments have been applied, as 
when plant number shortly before harvest is used to adjust crop yields 
for uneven growth, or as happened in the index of saltiness used by 
Bartlett. When the treatments do affect the z-values to some extent, 
the covariance adjustments take on a different meaning. They no 
longer merely remove a component of experimental error. In addition, 
they distort the nature of the treatment effect that is being measured. 
If the higher yields given by superior treatments are due mostly to 
their effects in increasing numbers of plants, a covariance adjustment, 
which attempts to measure what yields would be if plant numbers 
were equal for all treatments, may remove most of the real treatment 
effect. The F-test of treatments against error for the z-variate is 
helpful when there is doubt whether treatments have had some effect 
on 


2.2. To remove the effects of disturbing variables in observational studies. 
In fields of research in which randomized experiments are not feasible, 
we may observe two or more groups differing in some characteristic, 
in the hope of discovering whether there is an association between this 
characteristic and a response y. Examples are differences in heights 
of urban and rural school children, differences in illness rates between 
tenants of public and slum housing, and differences in expenditures 
for luxuries between clerical and manual workers. In observational 
studies it is widely realized that an observed association, even if sta- 
tistically significant, may be due wholly or partly to other disturbing 
variables z, , x, --- in which the groups differ. Where feasible, a 
common device, analogous to blocking in randomized experiments, is 
to match the groups for the disturbing variables thought to be most 
_ important. In the same way, a covariance adjustment may be tried 
for z-variables that have not been matched. 
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In a comparison of the heights of children from two different types of 
school, Greenberg [1953] found that the two groups differed slightly, 
though not significantly, in mean age. A covariance adjustment for 
age resulted in a more sensitive comparison of the heights. As a more 
complex example, Day and Fisher [1937] adjusted the log S. D. of 
leaf length (used as a measure of within-species variability) for fluctua- 
tions in length, breadth and thickness of leaves in comparing populations 
of Plantago maritima from different regions. 

In observational studies covariance can perform two distinct func- 
tions. One is to remove bias. To illustrate, it follows from model (1) 
that the unadjusted difference (y;. — y;.) between the means of two ~ 
groups is 


— Yi. = — 7; + — — 


If the two groups have not been matched for z, the difference (1; — z;.) 
may reflect a real difference in their z-distributions, being much larger 
than can be accounted for by within-group variations. The term 
A(x;. — 2;.) is then of the nature of a bias which if allowed to remain 
will render tests of significance and confidence limits invalid. If model 
(1) applies to the data at hand, a covariance adjustment removes their 
bias. Most users of covariance in observational studies would, I 
think, regard coping with bias as its primary function. However, even 
if there are no real differences between the z-distributions in the two 
groups, so that there is no danger of bias, covariance may still be used 
to increase the precision of the comparison as in the applications in 
section 2.1. 

Unfortunately, observational studies are subject to difficulties of 
interpretation from which randomized experiments are free. Although 
matching and covariance have been skillfully applied, we can never be 
sure that bias may not be present from some disturbing variable that 
was overlooked. In randomized experiments, the effects of this variable 
are distributed among the groups by the randomization in a way that is 
taken into account in the standard tests of significance. There is no 
such safeguard in the absence of randomization. 

Secondly, when the z-variables show real differences among groups— 
the case in which adjustment is needed most—covariance adjustments 
involve a greater or less degree of extrapolation. To illustrate by an 
extreme case, suppose that we were adjusting for differences in parents’ 
income in a comparison of private and public school children, and that 
the private-school incomes ranged from $10,000-$12,000, while the 
public-school incomes ranged from $4,000-$6,000. The covariance 
would adjust results so that they allegedly applied to a mean income 
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of $8,000 in each group, although neither group has any observations 
in which incomes are at or even near this level. 

Two consequences of this extrapolation should be noted. Unless 
the regression equation holds in the region in which observations are 
lacking, covariance will not remove all the bias, and in practice may 
remove only a small part of it. Secondly, even if the regression is 
valid in the no man’s land, the standard errors of the adjusted means 
become large, because the standard error formula in a covariance 
analysis takes account of the fact that extrapolation is being employed 
(although it does not allow for errors in the form of the regression 
equation). Consequently the adjusted differences may become in- 
significant statistically merely because the adjusted comparisons are 
of low precision. 

When the groups differ widely in z, these difficulties imply that the 
interpretation of an adjusted analysis is speculative rather than soundly 
based. While there is no sure way out of this difficulty, two precautions 
are worth observing. 

(i) Consider what internal or external evidence exists to indicate 
whether the regression is valid in the region of extrapolation. Sometimes 
the fitting of a more complex regression formula serves as a partial 
check. 

(ii) Examine the standard errors of the adjusted group means, 
particularly when differences become non-significant after adjustment. 
Confidence limits for the difference in adjusted means will reveal how 
precise or imprecise the adjusted comparison is. 


2.3 To throw ‘ight on the nature of treatment effects. This application is 
closely related to the previous one. In a randomized experiment, the 
effects of several soil fumigants on eelworms, which attack some English 
farm crops, were compared. After the treatments had been given time 
to exert their effects, the numbers of eelworm cysts per plot and the 
yields of the crop, spring oats, were both recorded. Significant effects 
were produced on both eelworms and oats. It would be of interest to 
discover whether the reductions in numbers of eelworms were the 
causal mechanism in producing the observed differences in oats yields. 
If the treatment effects on the oats (y) disappear after adjusting by 
covariance for differences in the numbers of eelworms (z), this suggests, 
at least at first sight, that the treatment differences are simply a reflec- 
tion of the differences produced by the fumigants on eelworm numbers. 
There are numerous instances of this kind in which a concomitant 
variable might be in part the agent through which the treatments 
produce their effects on the principal response. A covariance analysis 
offers the possibility of exploring whether this is so. 
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Here again, however, there are difficulties which restrict the utility of 
this ingenious tool. These are analyzed by Fairfield Smith in a later 
paper in this issue. Since in most of these applications the treatments 
will have produced significant effects on x, there is the problem of 
extrapolation discussed in the previous section. In addition, as Fairfield 
Smith illustrates, the interpretation of the adjusted y averages requires 
careful study. Sometimes these averages have no physical or biological 
meaning of interest to the investigator, and sometimes they do not 
have the meaning that is ascribed to them at first glance. 


2.4 To fit regressions in multiple classifications. The simplest situation, 
discussed in elementary text books, involves a single classification. 
By standard techniques we can (i) fit a separate regression of y on x 
within each class, (ii) test whether the slopes or positions of the lines 
differ from class to class, and (iii) if advisable, make a combined estimate 
of a commen slope. As an example of a regression in a row X column 
classification, the regression of wheat yield on shoot height, number of 
plants and number of ears was worked out from a series of growth 
studies on wheat conducted in Britain (Cochran [1938]). Each quartet 
of observations (y, 2, , X2 , 23) represented the mean over several plots 
at one station in one year. In order that soil, geographic and seasonal 
factors would be adequately sampled, data were obtained from 7 
stations for each of 5 years, making 35 quartets. Consequently it was 
necessary to fit constants for the mean yield of each station and the 
mean yield of each year. 


2.5 To analyze data when some observations are missing. An intersting 
by-product of the covariance method, first pointed out by Bartlett 
(1937), is that it may be used to compute the exact analysis of variance 
when some observations are missing. To each missing observation we 
assign any convenient value (e.g. 0, 5, or 100) and introduce 2 dummy 
z-variate that takes the value’ for the missing unit, and 0 for all other 
units. The standard covariance computations then give the correct 
least squares estimates of the treatment means and the exact F’- and 
t-tests. This method is probably slower than the inserzion of a missing 
value by the Yates formula [1933], but it is useful (1) with unfamiliar 
classifications where the Yates formula has not been worked out and 
(ii) where exact F- and t-tests are important, since the Yates’ method 
gives only approximate tests. 

Covariance can also be used (Nair [1939]) to estimate individual 
yields of a group of plots whose produce has inadvertently been com- 
bined, so that only a total over the group is known. 
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3. THE STANDARD COMPUTATIONS 


The computations, which will be reviewed briefly, are essentially 
the same for all mathematical models in which a single regression and 
a single residual variance are postulated. These cases include the simpler 
multiple classifications and experimental designs (randomized blocks, 
cross-over designs, latin squares) as well*°as balanced and partially 
balanced incomplete block designs without recovery of inter-block 
information. Separate discussion is required for hierarchical classifica- 
tions involving more than one regression equation or more than one 
residual variance in the specification, as with split-plot designs or 
incomplete blocks when inter-block information is recovered. 

The backbone of the standard procedure is an analysis of sums of 
squares and products into the Treatments and Error components. 
Table 1 shows the notation employed 


TABLE 1 
Sums or SQUARES AND Propucts 
D. f. (x?) (zy) (y?) 
Treatments (t — 1) Tes Tey 
Error fe Ey 
Sum t = 1 + Sos Ssy Syy 


A line is added giving the sums for treatments and error. Thus 
S.. = T,, + E,, , etc. 

The error s.s. for y is now divided into two parts: the s.s. for regres- 
sion on x (1 d.f.) and the s.s. of deviations (Table 2). The same sub- 
division is made for the Sums. 


TABLE 2 


PARTITION OF Ezz AND Srz INTO COMPONENTS FOR REGRESSION 
AND FOR DEVIATIONS 


(y*) Regression Deviations 
Df. D. f. S. s. M. s. 
Sum Sez 1 /Selt+fe—2 Sy — 


Treatments (by subtraction) t-1 T vy — S2,/See + 8: 


J 
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The reduced s.s. for Treatments is obtained by subtracting the 
deviations s.s. for Error from that for the Sum (last line of Table 2). 

The items of information most commonly wanted from this analysis 
are obtained as follows. 

(i) The regression coefficient 8. This is estimated from the Error 
line; b = E,,/E,, . The estimated standard error of b is s,/VE,, , 
with (f, — 1) df. 

(ii) The adjusted estimate of a treatment effect. In the simplest 
experimental designs, the unadjusted estimate for treatment 7 is simply 


the mean y;. of all observations having this treatment. The adjusted 
estimate is 


yi. = ys. — — z..) (1) 


(iii) The estimated standard error of any linear function L = 
> g:y;, of the adjusted treatment means is 


with (f, — 1) d.f., where r is the number of replications. In particular, 
the standard error of the difference between two adjusted treatment 
means is, putting g; = 1, g; = —1, and all other g’s = 0, 


2 (z.. — 

s.e(yi. — yj.) = 8 (3) 
(iv) For a test of the null hypothesis that all treatment effects are 

equal, we compute F = s?/s? , where s; and s; are the mean squares 

found in Table 2. This ratio has (t — 1) and (f, — 1) df. 


4. NATURE OF THE COVARIANCE ADJUSTMENT 


The structure of the covariance adjustment, —b(z;. — z..), is in 
accord with common sense: (z;. — z..) measures the amount by which 
the z-value for this treatment exceeds the average z-value, while b 
measures the change in y expected to accompany unit change in z. 
In a specific application, the sizes and directions of the adjustments 
are determined by the data. In this respect a covariance adjustment 
differs markedly from the type of arbitrary adjustment that has some- 
times earned a dubious reputation for the whole process of adjustment. 
It is not true, however, that a covariance adjustment is entirely objec- 
tive, since the investigator must choose the type of regression equation 
(e.g. linear, quadratic, linear in log z) from which the adjustment is 
derived. Moreover, the z-variables do not always measure what we 
would like to think they measure. One occasionally meets extravagant 
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claims for a covariance adjustment, as for example that data have been 
“adjusted to equalize socio-economic status,’’ when what has actually 
been done is to adjust for a linear regression on a crude social rating of 
the father’s occupation as reported by anyone “who happens to be at 
home when the interviewer calls. 

l’rom equation (3), the estimated variance of the difference between 
two adjusted means, when averaged over all pairs of means, works out as 


2s 
where t,, = T,,/(t — 1) is the treatments mean square for x. If t-tests 
are to be made between several pairs of means, this expression may be 
used, as an approximation, for the variance of the difference between 
any pair (Finney [1946]). This saves the labor of computing a separate 
standard error for each pair, as is required by the exact formula (3). 
This device is not recommended when the treatments produce significant 
effects on x, because the variance of the difference may be substantially 
greater for some pairs than for others, so that the use of a single average 
variance becomes unsatisfactory. 
More generally, the quantity 


2 tee 

+ (5) 
may be regarded as the effective error variance per unit in a covariance 
analysis, where the term in brackets is an allowance for sampling errors 
in b. Ina completed experiment, the gain in precision from covariance 
can be estimated by comparing (5) with s? = E,,/f, , the error mean 
square for y in its analysis of variance in Table 1. This comparison 
ignores the loss of 1 d.f. from the error which occurs with a covariance 


adjustment. ‘The effect of this loss on the sensitivity of the ¢-tests is 
small even with only 5 d.f. in error. 


5. THEORY OF THE TECHNIQUE 


The theory for the simplest designs will be illustrated by the row X 
column classification, with treatments as rows and replications as 
columns. The model is 


+ Ba; — 7.) +6; (6) 


Following the method of least squares, the unknown parameters are 
estimated by minimizing 


fy; —m—t, — — (7) 


| 
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Since ¢; need measure only the difference between the effect of the ith 
treatment and the general mean, we may assume 


= 0; Dr; = 0. 


The estimates. The algebra involved in finding the estimates can be 
reduced by introducing the variable 


Xi; = — 2.5 + (8) 


Those familiar with the analysis of variance will recognize z{; as the 
contribution of the (7, j)th observation to the error of z in its analysis 
of variance. The properties of z/; that will be useful are 


xi; = 0; xi; = 0 (9) 
= E,, ; = E., (10) 


The second relation-in (10) is less familiar than the others, but is easily 
verified. 


Now the given prediction equation 
= +7; + — 
becomes identical with the prediction equation 
=m + + bi; 
if the new estimates m’, t! , ri satisfy the relations 
m=m; t= (1) 


This may be verified by substitution. Further, since >> t; = >> r; = 0, 
it follows that >> = ri = 0. 

Hence, instead of finding m, ¢t; , r; and b so as to minimize (7), we 
can find m’ , t{ , rj and b to minimize 


(yas — m’ — th — — dei)? - (12) 
On expansion, (12) becomes 
— m’ —  — — 2b +P 
Note that the omitted terms 


vanish because of relations (9). 


| 
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Using (10), the quantity to be minimized is 
— m’ — — — + VE., (13) 


The advantage of this result is that b ig disentangled from the other 
unknowns. Differentiation with respect to b gives 


b = E,,/E,. (14) 


The other unknowns, m’, t/ , r/ , must be chosen so as to minimize 
the first term in (13). But this is exactly the minimization involved in 
an ordinary analysis of variance of y without covariance. Hence, by 
the standard results for the analysis of variance, 


Y.. 
Finally, from (11), the least squares estimates for the covariance 
analysis are 
=y¥.—y.. — — z..) (15) 
— — 2.) 


Since the ¢; represent deviations from the overall mean, the estimate 
used in practice is 


m+t=y,. — b@.—2z..) = yi., say. (16) 

Standard errors. From (13), the residual sum of squares may be written 

2bE., + Ev E., + E., Ev. E.. (17) 


The d.f. are (f, — 1), since 1 d.f. is subtracted for the regression. Hence 
s? , the residual mean square, is computed as in Table 2. 
To find the standard error of b, we may write 


rij + 7, + + —z.)+ e:;} 


From the properties of the z/; , this reduces to 


b=B+ 


This equation expresses (b — 8) as a linear function of the random 
residuals e;; . Hence 


| 
a 
| 
| 
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In the same way we find, for an adjusted treatment mean, 


Yi. = mt — — 


For a linear comparison between the adjusted means, it follows that 


L= gyi. = gi(u + 7.) + 
— gle, — z..)} 


By the properties of the z/, , the variate >> z/; e,, is uncorrelated with 
any of the meanse;.. This gives 


= 4 —2 ar 


in agreement with the result in equation it 
Note: The variate z/; can be used in the same way with all other 
designs (e.g. latin squares, incomplete blocks) to which the standard 


covariance theory applies. With an incomplete block design, for 
example, we define z/; as 


= — mM, — fis 
where é;, , fj. are the estimates of the treatment and block effects, 


respectively, in the incomplete block analysis of variance of z. The 
rest of the algebraic development goes through without change. 


Tests of significance. For the F-test, we quote the general theorem on 
the F-test in regression analysis, as applied to this problem. Let 


D= —-m—t,—17, — — 


where the constants are chosen in each case so as to minimize the cor- 
responding sum of squares of deviations. Then if all r; are zero, the 


quantity 
f 
is distributed as F with (¢ — 1) and (f, — 1) d.f. For proofs see, e.g., 
Yates (1938), Anderson and Bancroft [1952]. 

From (17) the denominator D/(f, — 1) has been shown to be s? . 
By the same approach, D” is the sum of squares of deviations from the 
“error” regression when only replication effects are eliminated in the 
analysis of variance. But in that event the “error” will be equal to 


| 
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“treatments + error’ in the present analysis. Hence the numerator — 
of F, (D’” — D)/(t — 1), is equal to s? as defined in Table 2. 


6. THE REDUCED SUM OF SQUARES FOR TREATMENTS 


The non-mathematical user of the analysis of covariance often 
finds it easy to rationalize the method by which b is computed and the 
formula for making the adjustments. The roundabout method by 
which the numerator of F is computed is, however, apt to appear 
mysterious. In this section the sum of squares D’ — D in the numerator 
of F is examined in more detail. Following Fisher, this quantity will 
be called the reduced sum of squares for treatments, Tyr . 

Intuitively, one might expect this quantity to be a squared com- 
parison among the adjusted treatment means 


b(z;. — z..) 


As is well-known, 7,,, is not the sum of squares of deviations of 
these adjusted treatment means. The latter sum of squares, 7,4 say, is 


Twa =r — y..) — — .)}? 
= T,, — 2bT,, + (18) 
= T,, — 2bb,T., + 0°T.. (19) 


where b, = T,,/T,, is the regression coefficient as computed from the 
Treatments line in the analysis of variance in Table 1. From Table 2, 
on the other hand, the sum of squares 7',,, in the numerator of F is 


2 2 


Subtracting (19) from (21) and taking the common denominator 
(T,. + we find 

2 


T.. + En, 


This result was first given by Yates [1934]. Equation (22) remains 
valid if T,,2 and 7T,,, are the reduced and adjusted sums of squares 
for any specific component of the treatments sum of squares, provided 
that 6, and T,, refer to this component. As Yates showed, the result 
provides an approximate short-cut method of making F-tests when we 
intend to test several components. The exact test involves going 
through the procedure in Table 2 separately for each component. We 


1 
| 
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can, however, easily calculate 7,4 for each component by relation (18). 
The F-values computed from T',,,4 will all be too large, but the approxi- 
mation is usually close if the treatments do not affect z, and only a 
few borderline F’s need be recomputed by the exact method. Table 3 
compares the approximate and exact F’s in an experiment in which the 
treatments sum of squares was divided into 4 components, (data from 
Cochran and Cox [1957], s.s. rounded to 100’s). 


TABLE 3 


TuE APPROXIMATE F-TEST FROM THE ANALYSIS OF (y — br)? 


Sums of squares 
Df.| xy  (y — bz)? F Piss 


Treatments 


8} 292 — 92 1574 2571 4.51 416 2.22 
Ave. L 1 31 —133 572 1062 14.90 14.53 4.12 
Ave. Q 1 22 83 311 106 1.49 1.46 4.12 
Dev. L 3 | 230 — 68 434 1205 5.63 5.05 2.87 
Dev. Q 3 9 26 ©6257 198 0.93 0.93 2.5 
Error 36 | 1214 1893 5447 2495 


The columns on the right show the approximate ratio F, , the exact 
F as found from the method in Table 2, and the tabular 5% value. 
Since none of the F, values is near the 5% level, no recomputation 
would be necessary in practice in this example. (The column of mean 
squares of (y — bzx)* has been omitted.) 

The presence of sampling errors in b explains why T,,, is not the 
correct sum of squares for the numerator of F. If the true 8 could be 
used in 7,4 , this sum of squares would give an exact F-test. The 
sampling error of b complicates the issue in two ways. Each adjusted 
treatment mean has a different variance, since 


Vivi) = oft + (23) 


Further, the adjusted means for the ith and jth treatments are not. 
independent, since b enters into both. Their covariance is 


— — 2.) 


cov = (24) 


Es 


Consequently, as would be expected from theory, it turns out that 
the correct numerator 7,” is a quadratic form in the adjusted means, 


Tye = a.(yi. — — y..) 


; 
| 
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where the matrix a;; is the inverse of the variance-covariance matrix 
of the y;; (apart from the factor o?). To find the a,; , we may write, 
from (22), 


7, — 


Now 
=r Vi. ){y. —y..) — 
=r — 2. yi. y.) 
Substituting this expression in (25) we find that 
2 2 
It follows that 
— 
ie + E,, (26) 
2 — 


The inverse of the matrix (a,;,) can be shown to be the covariance 
matrix of the y/ as given by equations (23) and (24). 

It is also instructive to compare 7,,, and 7,,, with the sum of 
squares of deviations of the treatment means y,. from the treatments 
regression on z, i.e. with the treatments sum of squares of (y — b,z). 
This sum of squares is 


= T,, — biT.: (27) 
and has (t — 2) d.f. From (19), 
Tyya = Ty, — 2bb,7,, + 


(28) 
using (27). Finally from (22), 
(b, — 
ion + E,, (29) 


| 
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Equation (29) shows that the reduced sum of squares for treatments 
separates into two parts: (i) the sum of squares T,,., with (¢ — 2) 
d.f. which represents the deviations of the treatment means from their 
own regression, (ii) a single d.f. which compares the regression coefficients 
from the treatments and error lines in the analysis of variance. It 
may be shown that if there are no treatment effects, this sum of squares 
is distributed as x’ o? with 1 d.f. In this way, equation (29) can be 
used to prove directly that, on the null hypothesis, 7,,. is distributed 
as x’ o? with (¢ — 1) df. In the same vein, equation (28) shows that 
T,,a consists of (t — 2) d.f. that are distributed as x’ o?. The remaining 
1 df. is inflated by the factor (T,, + E,,.)/E,. , a3 is seen by comparing 
(28) with (29). 

To sum up, the sum of squares 7',,z in the numerator of the F-test 
is a quadratic form in the deviations (y{, — y..) of the adjusted treatment 
means. The coefficients a,,; in this quadratic form are the inverse of the 
covariance matrix of the quantities y}. . As would be expected, 7,,. 
vanishes if all adjusted means are equal. Because of the presence of 
sampling errors in b, 7T,,2 does not equal the usual sum of squares of 
deviations T,,4 of the adjusted treatment means. T,,, always gives 
too large a sum of squares, but if treatments do not affect z it is a 
useful approximation to 7,,, when several components of the treat- 
ments sum of squares are to be tested. 


7. ASSUMPTIONS REQUIRED FOR THE ANALYSIS OF COVARIANCE 


The assumptions required for valid use of the analysis of covariance 

are the natural extension of those for an analysis of variance, namely, 

(i) Treatment, block and regression effects must be additive as 
postulated by the model, 

(ii) The residuals e;; must be normally and independently distrib- 

uted with zero means and the same variance. 

Although the effects of failures in these assumptions on the analysis of 
covariance as such do not appear to have been investigated, much of 
the related work on the analysis of variance carries over—for instance, 
that on the effects of non-normality or inhomogeneity of variance in 
the e,; . The general precautions that have been given about the prac- 
tical use of the analysis of variance should equally be observed in an 
analysis of covariance, see e.g. Cochran [1947]. 

Two assumptions that particularly involve the regression term in 
covariance should be noted. The treatment and the regression effects 
may not be additive. Bartlett [1937] pointed out that this danger 
might be present in the cotton-salt example already cited. On plots 
with a high salt content, the crop might belunable to respond to superior 


Se 
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fertilizers. Thus, in an extreme case, the treatment effects may be 
zero if z lies above a certain value. If this happens, the covariance 
adjustment may still improve the precision, but (i) the meaning of the 
adjusted treatment effects become cloudy, and (ii) if covariance is 
applied in a routine way, the investigator fails to discover the differential 
nature of the treatment effects—a point that might be important for 
practical applications. 

Secondly, the covariance procedure assumes that the correct form 
of regression equation has been fitted. Perhaps the most common error 
to be anticipated is that linear regressions will be used when the true 
regression is curvilinear. In a randomized experiment in which treat- 
ments do not affect z, the randomization ensures that the usual inter- 
pretations of standard errors and tests of significance are not seriously 
vitiated, although fitting the correct form of regression would pre- 
sumably give a larger increase in precision. The danger of misleading 
results is greater when z shows real differences from treatment to treat- 
ment. Later investigations by Fairfield Smith in this issue suggest, 
however, that this disturbance is serious only in rather extreme situations. 


8. MULTIPLE COVARIANCE 


No new difficulty is presented when there is more than one covariate, 
although the computations become more lengthy. Only the basic 
formulae will be given. With two covariates, x and z, the regression 
coefficients are obtained from the equations 


E,,b,.. + E,,b,.. Ew 
+ E,.,.. E,, 


If t-tests of the adjusted treatment means are wanted, it is advisable 
to compute the inverse of the Z,, matrix, say (c,,). 
In the simple orthogonal designs like randomized blocks, the adjusted 


mean of the ith treatment is estimated by 


The variance of the difference between the ith and jth treatment effects 
is 


a? + (x5. — + — (es. — — eu} 
The average variance over all pairs of treatments is 
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where ¢,, is the treatments mean square for z, etc. This is the extension 
of equation (4). If treatments do not affect z or z, the square root, 
Saicr » May be used as an approximate standard error of the difference 
for all t-tests. 

For the F-test, the regression equations from the error line and the 
(Treatments + Error) line must both be solved. If treatments do not 
affect x or z, an alternative is to begin with an approximate F-test based 
on the analysis of variance of (y — b,..« — b,.,z). It will be necessary 
to compute the exact F-value only in doubtful cases. 


9. MORE COMPLEX CLASSIFICATIONS 


Three cases that have received attention in the literature are hier- 
archical classifications, as represented by the split-plot design, incom- 
plete block designs with recovery of interblock information and two-way 
classifications with unequal numbers in the cells. 

In the split-plot design, two independent regression coefficients can 
be computed, b, from the whole-plot error line and b, from the sub-plot 
error line in the analysis of variance. If the two regression coefficients 
appear to differ, b, can be used to adjust whole-plot comparisons and 
b, to adjust sub-plot comparisons. The structure of the analysis of 
variance is given by Kempthorne [1952]. If the suffix 7 denotes the 
whole-plot treatments and j the sub-plot treatments, the adjusted 
means in the two-way table of treatment means can be computed as 
follows. 


Although equation (30) looks as if a bivariate regression is being 


used to make the adjustments, comparisons between whole-plot treat- 
ment means y{.. are actually adjusted by b, alone, since 


This form of adjustment avoids a discomforting feature mentioned by 
Bartlett [1937] and Truett and Fairfield Smith [1956], namely that 
individual adjusted means may not average to the appropriate whole- 
plot treatment means. Similarly, it is easy to verify that comparisons 
between sub-plot treatments and components of the interaction between 
whole-plot and sub-plot treatments are adjusted by b, alone. Both 
regression coefficients enter, however, into certain particular compari- 
sons. For instance, to compare two whole-plot treatments for the same 
sub-plot treatment, we take, say, 


— Yis. = Yai. — Yu. — b,(z2.. — 21..) (31) 


— — 215. — + 


a 
= 
be 
2 
. 4 
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As in the split-plot design without covariance, standard errors of com- 
parisons like this require special investigation. For (31), the estimated 
variance works out as 


28; + (y — 4 = 4 Gai. = = + 
7 E,.: 


where ¥ is the number of sub-plot treatments, s? , s; are the whole- and 
sub-plot error mean squares, and E,., , E,.. are the whole- and sub-plot 
error sums of squares for z. 

If the two regression coefficients can be assumed to be the same, a 
good working procedure, although not fully efficient, is to adjust all 
means by the sub-plot coefficient b, . This procedure, originally sug- 
gested by Bartlett [1937], has also been recommended after further 
examination by Truett and Fairfield Smith [1956]. 

Similar issues arise with incomplete block designs, since separate 
regressions can be calculated from the Blocks and Intra-block error 
lines in the analysis of variance. The computations for recovery of 
inter-block information, using the intra-block error regression to adjust 
all means, are illustrated for the 9 X 9 triple lattice by Cox and Eck- 
hardt [1940] and for the 7 X 8 simple rectangular lattice by Robinson 
and Watson [1949]. The approximations involved in this method 
need further investigation, particularly for the smaller designs. 

Methods for handling covariance analyses in the simpler non- 
orthogonal multiple classifications are given by Wilks [1938], Hazel 
[1946] and Das [1953], while Federer [1955] presents methods for lattice 


squares and changeover designs in which residual effects are to be 
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INTERPRETATION OF ADJUSTED TREATMENT MEANS 
AND REGRESSIONS IN ANALYSIS OF COVARIANCE* 


H. Farrrie.p 
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1. INTRODUCTION 


Statistical analysis with several variates or variables has become 
loosely classified into multivariate analysis, regression analysis and 
analysis of covariance. Most workers have more or less definite ideas 
about what is implied by each term, but few would agree with any 
specific definition of their boundaries. Most multivariate work is in 
one sense or another covariance analysis; but, since its introduction by 
Fisher [1934] as an adjunct to analysis of variance, the term ‘analysis 
of covariance”’ has had a special connotation, although the restriction 
may not always be maintained. 

Analysis of variance has itself acquired multiple connotations. In 
line with the comment by Eisenhart [1947] its etymological sense 
might be taken to imply detection and estimation of components of 
random variation associated with a composite population—now often 
referred to as variance component analysis. In a wider sense some have 
thought it might more accurately have been described as analysis of a 
sum of squares, particularly when the model specifies only one variance 
derived from random sources, and the parts of the analysis are compounds 
of this with constants by which class means are supposed to differ. 
In this form it is primarily an algorithm for tests of significance for 
estimates of certain constants, estimands of location. The associated 
model is conveniently called a regression model because the constants 
to be estimated can be formulated as regression coefficients. (See, 
for example, Anderson and Bancroft [1952]). When classifications 
(treatments) are qualitative the estimands can be regarded as regression 


*Part of a review of Analysis of Covariance sponsored by the Office of Ordnance Research, U. S. 
Army, under Contract DA-36-034-ORD-1517; and contributed to a Symposium on Analysis of Co- 


variance at the Detroit ting of the Bi tric Society (ENAR), American Statistical Association 
and Institute of Mathematical Statistics, September 7, 1956. 
282 


ad 
“he 


ADJUSTED TREATMENT MEANS 283 


coefficients associated with ‘(dummy variables’; when treatments are 
quantitative they may be formulated as regression coefficients in the 
usual sense of average rates of change relative to continuous independent 
variables. Analysis of covariance for regression estimation pertains 
especially, although not exclusively, to an extention of the regression 
model for analysis of variance. 

This paper considers two of the more important problems which 
arise in applications of covariance analysis: the interpretation of adjusted 
means, and the comparison of treatment (“external’’) and error (“‘in- 
ternal’) regressions. Sums of squares and products in analysis of 
variance and covariance of a dependent variate (y) and a concomitant 
variable (x) are indicated as follows: 


Sums of Squares and Products 
Df 
Between treatments (t — 1) Tss T sy T vy 
Error VE Exy 


The following notation will also be used in the discussion: 


by =T,,/T.. = treatment regression of y on x 
bs = E,,/E,, = error regression of y on z 
T* = T,, — brT., = T,, — Y? = sum of squares for treatments 
adjusted by br 
E* = E,, — bsgE., = E,, — Y3 = sum of squares for error adjusted 
by be 
7; = T,,b7 = square due to treatment (‘‘external’’) regression 
= E,,b3 = square due to error (‘‘internal’’) regression 
1 = square due to regression for pooled treatment and error 


= square due to “heterogeneity” of treatment and error regres- 
sions Y;, --- , Y, = (¢ — 2) other linear functions of y such 
that 7 Wi ; = T* = sum of squares for deviations of treat- 
ment means from treatment regression, and Lies Y2 = 
reduced treatment sum of squares. 
We assume the usual model, 


+ Br; +e; , 


where y,; and z,;; are the jth observations on the ith treatment of the 
dependent variate and concomitant variable, respectively, where the 


= 
4 
~ 
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general mean z.. is zero, and where the e¢,; are independent, random 
errors with zero means. Normality of the ¢,; is essential only for sig- 
nificance testing and interval estimation. The notation z;. and y;. 
will be used to indicate treatment means. In general we deal with the 
case of equal numbers of observations per treatment. The properties of 
the z;; are not specified above, and this paper is concerned primarily 


with necessary properties and with what happens if these properties do 
not hold. 


2. THE BASIS FOR USING COVARIANCE TO REDUCE VARIATION 


A postulate underlying the application of analysis of covariance to 
reduce the effect of extraneous variation on estimated treatment responses 
is that the concomitant variable, z, is unaffected by treatments, either 
by direct causation or through correlation with another affected char- 
acter. Given that postulate, means can be adjusted to give more 
precise estimates of treatment effects than if analysis of covariance 
were not used and interpretation in the usual way seems unambiguous. 
The first part of this paper will be mainly concerned with the very 
different conditions when the postulate may fail. But as a preliminary 
we will note two matters which are frequently assumed to have a 
meaning which they do not necessarily have. 

Assuming the foregoing postulate to be true, the formulation of 
orthogonal linear functions of the variate specified in section 1 is a purely 
formal analysis for the purpose of proving in an elementary way the 
properties of the partitions of the analysis of variance. All that is 
required for this purpose is to define (¢ — 1) independent linear functions 
which contain all the information about treatment responses uncon- 
founded with other parameters, in particular 8, and having homogeneous 
error variance. The easiest way to do this is to formulate (tf — 2) 
comparisons among observed means which are independent of 8, and 
a single remaining contrast, Y; , in which alone confounding with 6 
has to be considered. This last is the regression of treatment means, 
that is the regression of y;. on z;, which is 


br = Y7/VT.. = z;.y:./ >, 
= > t+ B+ 2. 


Variation of observed treatment means of the variate, y;. , which can be 
correlated with z,. being concentrated in this single contrast, decon- 
tamination of the treatment contrasts is easily seen to be accomplished 
by subtracting the independent estimator of 8 which is available from 
contrasts among experimental units within treatments. Thus the 


| 
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information about 7; which is latent in Y,7 is salvaged by 


Y, = (br — = 2) 


+ terms in ¢,; with expectation zero. 


The expression in brackets might be regarded as the regression of 7 on z, 
but it is not usefully so regarded because the postulate affirmed that r 
and z are unrelated and therefore that on the average of many experi- 
ments that regression must be zero. To test for a relationship which is 
known not to exist is entirely pointless. The only role of Y, is a purely 
mathematical one of making with Y; --- Y, a complete set of (¢ — 1) 
independent linear contrasts, with equal error variance, and containing 
all available information on treatment responses. From these any other 
linear contrast can be compounded, and its error variance indicated in a 
theoretically elementary manner; but individual contrasts dictated by 
the z;. which happen by chance to be associated with y,;. in a given 
experiment would rarely be of interest. In practice they are never 
computed since the sum of their squares can be otherwise obtained, 
and individual contrasts of special interest can be more directly com- 
puted as the required contrast of observed means with a regression 
adjustment. 

When the postulate cannot be made, or if it may be in doubt, Y, 
may acquire some individual interest. But the possibility that 7, is 
associated with z;, by more than chance is not tested by comparing 
Y, to internal error. This point will be discussed more fully in sections 
7-10. 

The concomitant variable 1s not necessarily a factor causally affecting 
the variate. The second point, which seems seldom recognized, is that, 
again assuming the postulate to be true, we need no assumption about 
the causal relation between z and y. It suffices only that z be correlated 
with something, often unknown, which causes extraneous variation of y. 
For example, we might use covariance on amount of a weed present in 
field plots. But the weed itself may have had no effect on the crop 
yield; unknown to us associated variation of yield may be due to soil 
acidity with which growth of the weed is also correlated. Assuming 
variable acidity to have been a prior soil character independent of 
treatments, comparison of yields adjusted by the observed regression 
to a common weed intensity is valid to eliminate from comparisons 
some of the variability due to acidity and any other incidental circum- 
stances with which weed intensity may be correlated. Control of the 
weed by cultivation or herbicide will not produce alteration of yield as 
stated by the regression. Without good ancillary knowledge of causative 
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influences one should not assume that the character measured as z is 
responsible for correlated variation of y. A main purpose of this paper 
is to demonstrate that such assumptions may have resulted in many 
false interpretations. 

Although examples given here are mostly from agriculture the 
arguments are general. For example in place of plant density an 
industrial chemist may read impurities in a reagent or still and make 
obvious transpositions of relevant questions. Was variable contamina- 
tion accidental? Did it vary between treatments due to different 
sources of supply? Will it always be associated with respective treat- 
ments, or can it and will it be controlled in future? And so on. 

A standard example is the effect of plant density on yield of field 
plots. It may vary between plots due to careless work by the planters, 
due to action of a drill being faulty or to its seeding rate varying with 
weight of seed in the hopper, due to variable germination associated 
with soil effects, or due to attacks by vermin. All of these should be 
randomly associated with treatments, error and treatment mean squares 
of z should not differ significantly, and the model is directly applicable. 


3. DIFFERENCES OF CONCOMITANT VARIABLE 
IMPOSED WITH TREATMENTS 

If treatments are varieties with different seed sizes, sowing all with 
a common drill setting may result in different densities per variety. If 
comparison at common density is appropriate, we could, and perhaps 
should, try to achieve this by adjusting the drill settings. But whether 
or not this is attempted, there will always be some variation in the 
number of seeds actually planted. If one knows that variable plant 
density within varieties derives from seed numbers planted, one may 
feel confident that the regression estimates the effect of alterations of 
seed rate and can be used to estimate what would have been the yields 
of the varieties at seed rates different from those actually sown, in 
particular at equal numbers of seed for all. Now suppose that plant 
number, in addition to being affected by number of seeds planted, is 
also affected by variable germination associated with soil fertility. 
In this case the covariance adjustment may be misleading. Fertility 
has a direct effect on yield quite independent of plant number, whereas 
plant number by itself may not, in fact often does not, affect yield per 
area at all. Thus if the number of seeds sown per variety could be 
controlled exactly, the regression of yield on plant number would in 
reality be a regression of yield on fertility, and adjustment to a common 
plant number would be equivalent to adjustment to a common 
level of fertility. But in practice seed numbers cannot be controlled 
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exactly, and when they differ enough to affect plant number significantly, 
a covariance adjustment to a common plant number is in general an 
adjustment to different levels of fertility—quite contrary to what is 
intended. This unhappy situation arises because for any given plant 
number, there are many combinations of seed number and fertility 
which would produce that particular plant number, and it would be 
impossible to associate each level of fertility with a single plant number. 
If one wants to use an observed regression to allow for an ancillary 
environmental effect which cannot conveniently be randomized with 
treatments and may vary systematically from one treatment to another, 
the only safe procedure is to arrange for it to be deliberately varied. 
Even when one is confident that a regression is attributable directly to 
the observed concomitant variable, unaffected by an intermediate 
correlate, if its variation within treatments be left to chance, it may 
happen that internal variation is too low either to evaluate the regression 
accurately or to overlap between treatments; the error variance of 
consequent extrapolated, or near extrapolated, adjustments may then 
be excessively high. The ideal arrangement, if an ancillary factor may 
seriously affect results, is to factorize it with treatments even if its 
levels may not be precisely controlled. For example Smith [1939] 
described a spacing experiment with wheat varieties where the field 
workers failed to plant the same number of seeds in the replicates of 
each treatment. There was no doubt about the reason for unequal 
plant numbers between replicates and therefore no doubt that the 
within and between spacing regressions would be evaluating the same 
thing. Responses were therefore evaluated as a regression of yield on 
observed plant numbers irrespective of the nominal spacings (actually 
several regressions were evaluated, one for each variety). But if spacing 
replicates had been uniformly sown and variation of plant numbers 
developed owing to field conditions, the internal and external regressions 
might, probably would, have been different. Although nominally both 
would be for yield on plant number they would be used differently, 
the internal being used to adjust in the usual way for variable soil 
conditions of unknown origin, and then the regression on imposed 
variation of plant density evaluated from the adjusted means. 


4. CONCOMITANT VARIABLE AFFECTED BY TREATMENTS 


The next situation to be considered arises when differences of x 
between treatments derive, not from imposition with the treatments, 
but as a consequence of treatments. Varieties, fertilizers, insecticides, 
etc., may affect germination and hence plant density. With foresight, 
after preliminary tests, one might have counteracted with different 
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seed rates. If such action be reasonable, then evaluation of likely 
results had we done so may be in order; but it may be wise to remember 
that now the differential germination is a part of the treatment effect, 
and to consider whether such counteragting action would in fact be 
taken in practice. Should the treatments be more appropriately 
compared at equal or at different seed rates? All too often writers 
blithely assume, without discussion of what “adjusted’”’ means really 
estimate, that they are ‘‘more correct” estimates of treatment effects. 
More cautious writers seldom go beyond the following remark (Saunders 
and Rayner [1951] p. 123) appended to an example with unequally 
germinating varieties of corn: “If variations in stand form an integral 
portion of the treatment effect, the covariance analysis should not be 
applied except in so far as it may be desired to assess the influence of 
treatments apart from any differences in stand which they may cause.” 
But even this reservation may not be enough. If varieties were planted 
with equal seed numbers, what caused the variation among replicates, 
and can the consequent regression be assumed to represent what would 
happen if the poorer germinating varieties were more densely planted? 
Or did poor germination of some varieties derive from using old seed; 
and if the counteracting action is to use fresh seed, with greater growing 
as well as germinating vigor, would the response be anything like that 
of the internal regression of the given experiment? Suppose in the first 
example above that fertilizer or spray treatments had affected the 
quantity of weed, the interpretation put on adjusted yields is too often 
equivalent to saying that they represent treatment responses to be 
anticipated if weeds were equalized by cultivation: a possible inter- 
pretation but probably false. When the influences causing an observed 
regression are unknown, estimates obtained by projecting along it, 
perhaps to an z which is incompatible with a given treatment, have 
little if any meaning. Frequently they cancel out effects which an 
experiment was intended to detect. 

Snedecor ([{1946], Table 12.13) quotes an experiment in which ferti- 
lizers affected both germination and yield of beets. The observed 
regressions for yield per beet in arbitrary units on plant number per 
plot were: 


Between fertilizers (6d.f.) b = AT + .02 
Between replications (5 d.f.)b = —1.16 + .43 
remainder (30 d.f.) b = .25 + .11 


For a change in plant number only, without associated fertility effects, 
such a regression should be negative, i.e., increased number of plants 
per acre results in smaller yield per plant. The remainder regression in 
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this experiment seems to be a fertility effect and using it to estimate 
the effect of equalizing stands by altered seed rates would adjust in the 
wrong direction and be worse than useless. Only rarely will there be so 
plain a warning that an observed regression does not represent what one 
might automatically suppose; the hazard for estimating effects of 
modified treatments must be nearly always present and yet has been 
almost universally ignored even when, as here, contradiction of a 
known effect raises a red flag. 

In this experiment the observed treatment and error mean squares 
for plant number (z) and the observed and reduced mean squares for 
yield per acre (y) were 


Observed m.s. Reduced 


A 
Treatments 6 19337 18.809 411 
Error 30 956 .774 .241 


The customary inference is to admit that adjusted means have had some 
treatment effect eliminated from them, but to say that since the reduced 
treatment mean square is now insignificant we have obtained the 
“valuable’’* additional information that the treatments have produced 
nearly all their effect by way of effects on plant number. But this does 
not follow. Without direct experimental variation of plant numbers 
we cannot confidently know what effect on yield per area would have 
been caused by altering them alone. Increasing them would almost 
certainly depress yield per beet and wide variation around customary 
seeding rates usually has little effect on yield per area. The most 
that can be allowed for effect of plant number on yield per acre is to 
increase it proportionally when the regression for yield per beet is 
zero. Such response could be expected only when plant densities are 
so low that plants do not compete with each other, and even if that 
unlikely condition existed in this experiment it could account for less 
than half of the observed response to fertilizers. 


5. DEPENDENT VARIATE AND CONCOMITANT VARIABLE 
BOTH CHARACTERS OF THE EXPERIMENTAL MATERIAL 
Consider finally the situation when z and y are both measurements 
of the experimental material. For example, let treatments be varieties 
of corn, and z the number of ears at constant plant density, a precursor 


*Not a quotation from any particular paper; the adjective commonly appears in the context 
indicated. 
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of yield. Ear number, an innate variety characteristic, cannot be 
altered at will. Comparison of yields adjusted to equal ear number 
is then artificial because we cannot in reality produce equal ear numbers 
under conditions in which yields would ordinarily be comparable. 
We might be able to alter ear numbers by changing plant densities or 
fertilizers, but the comparison adjusted to equal ear numbers would 
then be a comparison for different, perhaps violently different, environ- 
mental conditions. 

For the sake of argument suppose that for some reason one is inter- 
ested in such comparison, say for ear numbers equalized by different 
densities of stand. If interpretation is to be unambiguous the inference. 
must, as above, be based on a regression which truly reflects the in-- 
tended mode of variation. Suppose observed plots vary substantially 
in both fertility and plant number, but that the plant numbers (p) 
depend only on seed rates which were randomized with plots. Since 
fertility affects both yield (y) and ear number (e) the regression of y 
on eis a mixture of fertility and plant number effects, and will not measure 
the effect of changing e by varying p. Regression on p should not 
however be affected by fertilities with which it has been randomized. 
Assume that all regressions are linear, and for simplicity consider the 
contrast of just two varieties (A, B) with observed means 7%, , etc. 
From the regression of e on p we may estimate that varieties would 
have equal e when py, — ps = (@4 — és)/b,, . And from the regression 
of y on :p estimate that if compared at p, and pz the difference of yields 
would be 


Ga Gs = Va (é4 é,)b,,/b.> 


That is, to answer the question as to whether yields would be equalized 
by altering p to force equal e we should perhaps compare the ratio 
(ys — Ys)/(€a — €g), With b,,/b., , rather than with the observed b,, 
within treatments. (Compare the use of “instrumental variables.” 
A recent review of the topic is given by Durbin [1954].) Distribution 
problems with ratios of regression coefficients may be difficult, and 
field experiments would rarely be sufficiently accurate to yield in- 
formation on such refined comparisons. We are concerned only to 
point out that adjustment of one variate for a mutually dependent 
one by means of a regression derived from variation of uncertain origin 
leads to estimates without~ tangible interpretation. Many workers, 
warned by Bartlett [1936], are aware*of the difficulties; but others who 
present such statistics rarely show recognition of having estimated 
something completely different from an equalization of extraneous 
variation. 
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Two papers written in the first flush of enthusiasm for a new tech- 
nique and reporting such adjustments (Garner et al [1934]; Brady 
[1935]) have been much quoted as model examples of how covariance 
analysis may aid and extend interpretation of experimental data. 
Comparison of regressions may be informative (cf. sec. 7-10), but 
many hours endeavoring to see why the adjusted means of these papers 
are supposed to be helpful invariably end (for me) in a sense of frustra- 
tion and thinking in circles! When treatments induce simultaneous 
variation in all characters to estimate their effects on one while “holding 
another constant” is artificially fictitious. To my mind such estimates 
are anything but illuminating. The commonly occurring phrase 
“corrected means,” with its implication that more correct comparisons 
are thereby made, is misleading; ‘‘fictitious means’ might be less decep- 
tive. In absence of experimental control to demonstrate what in 
fact would happen if a given character were “held constant,”’ one can 
seldom demonstrate that actually false conclusions are reached, but 
treatment means “‘adjusted’”’ by error regressions seem to yield no 
information which cannot be gleaned from simpler statistics unconfused 
by distortions whose interpretation is dubious. 

To illustrate let us re-examine Brady’s example which is still quoted — 
in standard expositions (e.g. Wishart [1950]). He was studying thickness 
of sclerenchyma cell wall (y) in oats as a possible index of resistance 
to lodging, and considered its relation to length (z,) and diameter (z,) 
of an internode. He grew three varieties, which differed markedly in 
lodging potential, at three spacings in a 9 X 9 latin square. His analyses 
of variance, crude and reduced, and variety means, were as in Table 1. 
His conclusions were: 

“In comparing the corrected means, the significant value of z 
already obtained [for the reduced variety mean square] is undoubtedly 
due to Sandy being lower than Sonas or Victory II, the difference 
between the latter two varieties being quite insignificant. It is note- 
worthy that according to the conclusions following [the analysis of 
variance of observed y] Sonas was adjudged as having significantly 
thicker sclerenchyma cell walls than Victory II. It is evident that 
most of the original mean difference between these varieties is due to 
the fact that the Sonas plants had shorter internodes and greater 
internode diameters than the Victory II plants, and that these characters 
are respectively negatively and positively correlated with thickness 
of cell wall. It is now possible to make an important deduction, namely, 
that the conclusions arrived at from the analysis [of variance of ob- 
served y] have no general significance. It is known that length of 
internode is easily varied by changes in soil fertility. Consequently, on 
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TABLE 1 
MEAN SquaREs AND MEANS FOR BraDy’s EXPERIMENT 
Mean Squares (y) 
D.f. Observed Reduced 
Variety 2 11.0491 1.1309 
Spacing 2 1.0465 . 2402 
Interaction 4 0912 . 1839 
Error () | (66) .3482 (54) .3135 
Variety Means 
Observed “Adjusted” 
y y 
Sonas 9.13 5.59 5.12 4.79 
Victory II 10.56 5.01 4.71 4.72 
Sandy 13.45 5.28 3.87 4.19 


account of the correlation between length of internode and thickness 
of cell wall, it follows that the latter would also be subject to variation 
as a result of soil heterogeneity. With different soil conditions it might 
be possible to obtain results of an entirely different order from [the 
observed means]. Hence, thickness of cell wall cannot be taken as 
providing an absolute index of the loding resistant potentialities of a 
variety.” 

The heart of this conclusion lies in one word, “absolute,” which 
would be easily overlooked in hasty reading. The author was merely 
concluding that the sclerenchyma observed on one variety at any time 
and place would not be a certain guide for grading it relative to another 
variety observed in different conditions. This is a reasonable conclusion, 
and is only what we should expect. Constancy between varieties is 
possible for some qualitative characters such as petal number and 
color, but is rare and unlikely for quantitative characters. Even the 
difference between Mendel’s giant and dwarf peas could be masked by 
growing the one in impoverished, the other in fertile soil. If sclerenchyma 
might show an equally clear cut genetic effect, only direct experimenta- 
tion in different conditions will prove how much environmental variation 
it can stand before varietal differences would be masked. The regression 
argument provides only circumstantial evidence. Firstly it has not 
been demonstrated how much change of z, and x, can be produced by 
varying ervironment, and secondly the regression projections are 
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unreliable as proof of what would happen to y when they are thus 
varied. The reduced mean square can be misleading as an indicator 
of what difference in y might still exist when variety-environment 
combinations were arranged to equalize the z’s. It has been literally 
reduced to be comparable to an error term excluding errors of adjust- 
ments. Suppose the z’s were equalized by the same influences as operated 
in the observed latin square. The mean square for variety means 
adjusted by the observed regression would be 2.949 whereas the reduced 
mean square was only 1.131. For Sonas-Victory the observed scle- 
renchyma difference was .41 + .161 microns; the “adjusted”’ difference 
was .084 + .204. But the adjustment alone (—.32) has a standard 
error of +.136 so that with 95 per cent confidence we do not really 
know but what the correct adjustment might have been only —.05. 
Corresponding figures for Sonas-Sandy are observed difference 1.25 + 
.161; “adjusted” difference .617 + .275; adjustment —.63 + .229 with 
confidence only that it would probably be bigger than —.17. These 
estimates depend furthermore on the postulate that the appropriate 
regression has been estimated, a postulate which has been noted above 
to be often rather shaky. Hence, such projections are not conducive 
to confident evaluation of what would really happen. 

Anyhow regression arguments among mutually variable characters 
are inappropriate. The speculation is that the internode characters 
which are easily observable may provide as good evidence of lodging 
potential as internal characters which might not then be worth using 
owing to the difficulty of observing them. The speculation is reasonable, 
but evidence for or against it should derive from a regression analysis 
of lodging as a variate on all potential predictors studied on equal 
footing as independent variables. Alternatively, in absence of scaling 
for lodging resistance, one might consider a canonical variate or principal 
component analysis for discriminating between varieties. One could, 
of course, state that two very highly correlated characters (or sets of 
characters) would give little better discrimination than either one alone. 
But the multiple correlation of sclerenchyma with both internode 
characters is only R? = .132 in the remainder row of the covariance 
analysis with 56 degrees of freedom, (P = .02), or R* = .195 for total 
variation within varieties with 78 degrees of freedom including spacing 
effects. (With only three varieties the between variety correlation of 
course cannot be evaluated.) These figures therefore do not support 
the conclusion that sclerenchyma may add little or nothing to dis- 
crimination. 

Conclusions adduced from the covariance analysis of Brady’s 
experiment contain two phrases which commonly occur in reports of 
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correlation studies. ‘Most of the original mean difference [of scle- 
renchyma] between these varieties is due to the fact that” they had 
different internode lengths and diameters; and “differences between the 
variety means for thickness of cell wall were exaggerated by reason of 
this correlation” (Wishart [1950]). Apart from the biological correla- 
tions being low with causative connections unknown, which circum- 
stances make the statement even more dubious, such inferences seem 
analogous to saying that the difference between observed heights of 
Mt. Everest and Pike’s Peak is “due to”’ air density and is “exaggerated,”’ 
implying that it is false, because the difference adjusted for correlation 
of altitude with atmospheric density would be negligible! Obviously 
such conclusions are incorrectly, or at best metaphorically, phrased; 
but even after recognizing that, the intended meaning always seems to 
remain rather puzzling. One may speculate on the consequences for 
something else—relative difficulties of climbing or assessment of lodging 
potential—if circumstances promoting the correlation were altered and 
how. The speculation may be idle curiosity—suppose the summit of 
Everest were at 10,000 feet; or it may be about something of practical 
consequence—could Everest be climbed as easily as Pike’s Peak after 
inventing oxygen and pressure suits? The latter type of suggestion, 
especially when derived from low correlations of unspecified origin 
should be recognized to be, not a conclusion, but a speculative hypothesis 
until it can be verified or modified by demonstrating what in fact are 
the consequences after the environment has been vemmmeed 
altered in the specified manner. 


6. A PHYSICAL-CHEMICAL EXAMPLE 


On rubber plantations hydrometric observations, that is measure- 
ments of density, are commonly used to indicate dry rubber content of 
latex. In an experiment on the effect of some treatments on a physical 
or chemical character of latex it might be used as a concomitant variable. 
It may serve well to control effects of variability in both d.r.c. and serum 
of latices which have been randomized with treatments. But it will 
not serve as a control for bringing latex from different sources to equal 
d.r.c. by adding water or ammonia solution; or to estimate results at 
equal d.r.c. when the treatments impose different degrees of dilution. 
The imposed variation between treatments would then be on a curve, 
depending on density of the diluting fluid, which is easily shown (Smith 
{1940]) to be different from that for the relation between d.r.c. and 
density of latex as produced by the trees. Furthermore chemical and 
physical characters are often influenced by serum solids, and whereas a 
lower d.r.c. with inereasing density of field latex implies more serum 
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solids, to increase density by artificial dilution would reduce these in 
proportion to rubber content. Hence if, unknown to the experimenter, 
variability of the observed character happens to depend more on the 
serum solids than on rubber content, adjustments on density as it 
varies between latices could be in the wrong direction. Similar remarks 
apply if the concomitant control were direct observation of rubber 
content but the variate happens to be affected by serum solids and 
varying degrees of dilution are introduced with treatments. Within 
treatments variation derived from different latices would show negative 
correlation of rubber content and serum solids, whereas between treat- 
ments they would be positively correlated and adjustments made on 
the assumption that rubber content was the sole cause of an observed 
regression would be false. 


7. COMPARING TREATMENT AND ERROR REGRESSIONS 


Computationally the partition of sum of squares for treatment 
regression and mean error regression, Y; and Y; , into the alternative 
parts Yj and Y} is exactly the same as the analysis of variance for 
regression heterogeneity. But there is no point to considering Y, as a 
regression contrast (by — be) unless the between treatment regression 
has some meaning, which in turn usually implies that we can anticipate 
or postulate some association between 7; and z;. which may be worth 
estimating. Regression of treatment (or class) responses on x can 
have meaning only when z has some connection with treatments, 
either is influenced by them or is a characteristic of them, so that a 
causal or correlational association of y with z, dependent on treatments 
may be anticipated. Perhaps the simplest example occurs when treat- 
ments are qualitatively determined but have a measurable ingredient 
variable within them. Two questions may be raised. Can that in- 
gredient be considered responsible for all treatment responses? If it 
may not account for all responses, is the between treatment response to 
it the same as that occurring within treatments or is it modified by 
interaction or by confounding with other ingredients? Perhaps most 
cases where the external regression has meaning as measure of a quan- 
titative response may formally be reduced by analogy to that form, 
even although the independent variable is not in ordinary parlance 
an ingredient. 

Boundary cases are worth noting for perspective. When treatments 
are quantitative—levels of a fertilizer, temperature, concentrations— 
the responses are efficiently summarized by a regression on the levels, 
obviously a “between treatment” regression on an “ingredient” which 
happens here to be the total stimulus applied and to be constant for 
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replications at each level. We could formally analyze the experiment 
as an analysis of covariance, but since there is no variation of z within 
treatments all sums of squares of, and of products with, x would be 
zero except between treatments, and no internal regression can be eval- 
uated. Hence it would seem artificial so to set out the computations, and 
without such ado the linear, quadratic, etc., terms of the regression are 
broken out as special contrasts between treatments. 

The ingredient z might be an impurity whose effect we are concerned 
to “eliminate.”” There are then two boundary cases. (1) The quantity 
of impurity may not vary within treatments, and as in the preceding 
paragraph there is no internal regression. (la) The treatments proper 
are qualitative and we can postulate that responses to them will be at 
random relative to quantities of impurity. The effect of impurities 
can then be estimated from the external regression and treatment 
responses by deviations from it. (Notice two incidental consequences. 
The standard error of the regression on z—which one may wish to 
examine even if only incidentally—will depend on variance of treatment 
means about it, not merely on the internal experimental error. The 
treatment mean square will have only (t — 2) degrees of freedom, 
corresponding to Y; --- Y, , and magnitudes of individual responses 
as measured by deviations from the regression will be proportionately 
biased.) (1b) Treatments proper are quantitative. Separation will 
then be by ordinary regression with the treatment means as variates. 
(2) The impurity is variable within treatments and can be assumed 
independent of treatment responses. This returns us to the typical 
analysis of covariance for elimination of extraneous variation by adjust- 
ing treatment yields on the internal regression. Although the extraneous 
variation is in the treatments themselves, instead of as usual in the 
experimental units, the consequences are obviously the same. 

The following sections give some examples where the external 
regression may be of interest. 


8. AN EXAMPLE ILLUSTRATING THE TEST FOR INTERNAL VERSUS 
EXTERNAL REGRESSIONS 


Correlated plant characters in a variety trial or genetic experiment 
provide an example of internal and external relationships which may 
be different. Between plots of one variety, strain or clone, (assumed 
genetically homogeneous), variation is determined by reactions to 
environmental conditions (fertility, water, etc.); between varieties 
it is determined by genetic factors. Table 2A presents an analysis of 
variance and covariance for logarithms of ear number per plant (y) and 
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of number of grains per ear (x) observed in quintuplicate plots of 9 
wheat varieties. (For simplicity some unsatisfactory blocking restric- 
tions are ignored.) Since the environmental variation is of course not 
entirely averaged out of strain means the estimates of genetic variance 
components, and by analogy of the covariance component, are obtained 
as in Table 2B. These suggest a genetic correlation r = .94. Should 
we now assume a functional genetic relation or merely a correlation? 
If the latter, can a component regression usefully describe the rela- 
tionship or should some more central line—such as the principal com- 
ponent—be used? Anyway it is at once obvious that the estimate of 8,, 


TABLE 2A 


ANALYSIS OF COVARIANCE FOR EAR NUMBERS PER PLANT (rz) AND GRAIN NUMBERS 
PER Ear (y) oF 9 WHEAT VARIETIES IN 5 REPLICATIONS 


Sums of squares and products 
Source of (for logarithms x 10*) 
variation 
Df. y? yz b 
Varieties 8 16,329 — 16,262 23,724 — .6855 
Within varieties 36 5,971 920 14,911 .0617 
Source of variation Df. S.s. | M.s. 
Variety regression 11,147 


Av. env. regression 1 

Deviations from var. regression 7 5,182 | 740.3 
Deviations from env. regression | 35 


_TABLE 2B 
ESTIMATION OF VARIANCE AND COVARIANCE COMPONENTS 


Source of varia#ion y? yz b 
Varieties 2041.1 — 2032.8 2965.5 
Within varieties 165.9 25.5 414.2 


Difference = Variance 
and covariance com- j.. 
ponents X 5 1875.3 — 2058.3 2551.3 — .8068 


be 
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from (by; — be) as described in section 2 will not be the same as a regres- 
sion from the variance and covariance components. Even if a regression 
approach to the problem may be useful models evidently need redefining 
with some care. Very little work has been done on regression with 
components (for some suggestions see Tukey [1951]): it is an application 
of analvsis of covariance which extends into multivariate analysis 
and therefore goes beyond elementary regression formulation and the 
field of this review. 

However to illustrate some other matters, suppose we have a pre- 
diction problem, say to predict grain numbers before maturity until 
when they cannot be observed, and we want to test divergence of 
regressions for means of 5 plots of different varieties and for single 
plots within a variety. The varieties are assumed to be a sample of a 
population of varieties, not necessarily a random sample of the whole 
population (as the nine here were not, having been selected to represent 
varieties of different tillering capacity but that each one is a random 
representative of a sub-population conditional on the observed ear 
number. (That is of course the elementary condition for an unbiased 
estimate of any regression.) 

Now the precision with which a regression can be determined from 
a sample depends on the variance of observations about it. Relative 
to determining a correlation or regression over varieties, the variety 
means are single observations. Their variance about regression can 
be equivalent to experimental (environmental or internal) error only 
if x is a true independent variable free of error and imposes a functional 
relation on the “true” class means. Hence the precision of the between 
variety regression must be estimated from 7*/(t — 2). 

Lest there be any doubt about this, consider that a contrast between 
just two varieties could be tremendously significant relative to experi- 
mental error, but the existence of a correlation cannot be established 
from cnly two points it has no degrees of freedom with which to show 
how much correlation may be relaxed among other class means. Al- 
though this should be obvious when pointed out it has not always been 
recognized. Many fictitious correlations have been reported as if 
from large numbers of observations when actually they derived from 
only 3 or 4 dominating class differences. 

Since the external and internal regressions are uncorrelated (pro- 
vided we can assume equal error variance within every class) the 
estimate of error variance for their difference is 


E* 


Var (br — bg) = 


(t 2)T.. + (vg =; 
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and since its two parts are derived from estimates of variances which 
cannot be assumed equal the test statistic 


d = (br — 


must be referred to Sukhatmé’s table (Fisher and Yates, Table VI) 
or to the t-table with quasi-degrees of freedom as suggested by Welch 
[1938], Smith [1936], Satterthwaite [1946] and Cochran [1951]. For 
the example 


» _ 740.3 


169.0 _ 
4 23724 


14911 


+ 04254 


and 
d’ = .7472?/.04254 = 13.12 


which is plainly significant without stopping to compute degrees of 
freedom which must be somewhere between 7 and 42. 


Substituting Y} which may have been extracted in the analysis of 
variance an equivalent formulation is 


F’ (T + = (T + 


where vy and vz are the mean squares for deviations about external 
and internal regressions with (¢ — 2) and (ve — 1) degrees of freedom 
respectively. 


9. AN EXAMPLE OF A TREATMENT-INDUCED CONCOMITANT 


This example, from data supplied by Dr. D. D. Mason, illustrates 
occurrence of the concomitant variable as a treatment ingredient in fact 
instead of merely formally. Alfalfa was prepared in three ways—as 
field-cured hay, barn-cured hay and wilted silage; and was fed to six 
blocks of three cows each in a switch-back experiment, i.e., the “plots” 
within blocks are three periods within a three-cow lactation term. Some 
restrictions on randomization for assessment of carry-over effects are 
ignored here for simplicity. The observations were X = mgm. of 
carotene intake in each plot (estimated from analytical samples from 
each ration), and Y = vitamin A potency per pound of butter fat. 
The objectives were stated to be: (1) To determine the overall effect 
of different methods of curing alfalfa on the vitamin A potency of 
milk, and (2) To determine the effect of the methods of curing inde- 
pendent of the carotene content. The first is answered by the observed 
means and their standard errors. The purpose here is to give critical 
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consideration to just how much about the second question can be inferred 


from covariance analysis. 


Logarithms of the data and the analysis of variance and covariance 
are given in Table 3. The negative correlation between blocks (r = 
—.203) is noteworthy since it masks correlation of the variates within 
treatments if block effects were not extracted. Although not significant 
with so few observations, the correlation is probably real due to the 
units observed; it suggests as is reasonable that large butter producers 


TABLE 3 


CaRorene InTAKE (X) AND ViTamiN A Potency PER Pounp or Butrer Fat 
(Y) ror Taree Types or ALFALFA ForaGe: t = 10° (logioX — 2), 
y = 10° (logwY — 3.8) 


Field hay , Barn hay Silage 

1 149 136 297 138 781 231 

2 164 154 393 174 766 280 

3 64 111 299 131 675 219 

4 90 96 279 207 659 261 

5 218 76 389 66 919 263 

6 193 85 360 85 979 272 
Total 878 658 2017 801 4779 1526 
Mean 146.3 109.7 336.2 133.5 796.5 254.3 


ANALYSIS OF VARIANCE AND COVARIANCE 


Sums of squares & prod. - Regression 
Ds. zy y? b Square D.f. Deviations 
Blocks 5 83580 —13437 9886 — 
Treatments 2 1341320 308411 72181 .22993 70913 1 1268 
Remainder 10 29930 11922 11578 .39833 4749 9 6829 
Treatm. + Rem. | 12 1371250 320333 83759 | .23361 74832 11 8927 


Heterogeneity of regression 
Deviations from treatm. regr. 
Reduced treatm. sum of squares 
Error mean square 


DA. M.s. 
1 830 
1 1268 
2 2098 
9 758 8 
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eat more but not in proportion to production, total intake is increased 
but is less per pound of butter. 

The analysis of variance looks pleasantly clear cut. Regression 
squares are high, indicating that carotene content of forage is associated 
with vitamin A in butter, and both regression heterogeneity and devia- 
tions from regression squares are similar to the estimate of error variance. 
Clearly there is no evidence against the hypothesis that all observations 
are collinear and that forage effects may derive entirely from their 
carotene contents. This of course is very far from proving that carotene 
is the only substance in the forages affecting vitamin A response. For 
one thing, since all forages are derived from a simular substrate, alfalfa 
(presumably all from one field), there may be a correlated substance 
which has not been measured. Apart from that. means adjusted on 
the internal regression are virtualiv useless: they involve violent extrap- 
olation and insignificance of their difierences means nothing. For 
example the observed difference ot 1 netween silage and field hay is 
145 + 19.6; the adjusted difference is —114 + 105 with almost all its 
standard error derived from uncertaintv about the regression. The 
variance is compounded as follows: 


42 _ 610.33 + 14.12) = 10970 
(Arithmetic data show simiiar significance tests but different relative 
magnitudes. The respective differences are observed 3248 + 410 mgm, 
adjusted 958 + 8Y0.) If one were seriously concerned to determine 
whether the metnod of curing alfects vitamin production other than 
through its effect on carotene in the forage the experiment needs to be 
supplemented by hays reiniorced by extra carotene so that more direct 
comparisons can be made. (if the effect of carotene is accurately 
known from other sources, a better projection might be made; but with 
the need to make allowances tor quantities eaten and produced such 

theoretical projections for biological material are often doubtful.) 
However it 1s not unlikely that when doing an experiment of this 
sort one has a fairly confident anticipation that all response wil! be 
related to carotene, that results can be summarized in terms thereof, 
and that objective (2) as stated above is less an objective than a check 
for any contrary evidence before going on to such summary. The 
experiment might even have been done primarily to evaluate the effect 
of carotene as “naturally” varied, perhaps for comparison with other 
experiments using a “synthetic’’ form. Clearly the effect may be 
better evaluated from between treatments owing to the extended 
range, and this may yet be so even if treatmeiits have a superimposed 
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qualitative effect if we can assume such to occur at random relative 
to the mean carotene contents. We will of course check for any dis- 
agreement between internal and external regressions, but since in this 
case the internal is poorly determined with 95 per cent confidence 
interval .038 to .758 the check does not mean much. The main difficulty 
if we allow the possibility of superimposed treatment effects is that 
with only three treatments we have no satisfactory estimate of error 
for the external regression. Technically its significance stands to be 
tested by t = (70913/1268)'”? = 7.62 with only one degree of freedom 
and so needing 12.71 to reach the 5 per cent significance level, but 
since we expect an effect approximately as indicated our subjective 
confidence is high. There is little however that can be done unless we are 
prepared to accept the hypothesis that carotene is the only operative 
agent, and that treatments add no extra responses. The situation then 
becomes similar to the wheat seed spacing experiment quoted in sec. 3 
and the regression can be estimated from total variation within blocks. 
This yields the estimate b = .2336 + .0243. Even allowing that the 
standard error may be under-estimated this should be much better 
than the internal estimate with error +.159. (The alternative is to 
take the treatment regression, .230 with error +.028 based on the 
reduced treatment mean square with 2 degrees of freedom. Even 
with so few degrees of freedom this would still be better than the internal 
regression. ) 

Finally one may note that since X can only be estimated from samples 
extracted from the forage actually eaten it must contain error which 
could bias the regression as estimate of true response. In a case like 
this the functional line could be estimated using treatments as “‘instru- 
mental variate” leading in effect to the Wald-Bartlett-Nair-Shrivastava 
method of averages; but this refinement is trivial relative to ignorance 
of the true form of curve which should be fitted. 


10. NOTES ON BIBLIOGRAPHY 


One or two text books earlier, and some half-dozen since 1950*, have 
presented expositions of analysis of variance for testing regression 
heterogeneity including the external versus internal regression, but 
interpretation of the latter is usually either side stepped or at best 


*Among texts on general experimental statistics the test for heterogeneity of regressions in in- 
dependent samples is given by Villars [1951], Goulden [1952], Tippett [1952]; the break-out of Ys 
for br — bg is given by Wishart and Sanders [1936], and Saunders and Rayner [1950]; both are given 
by Snedecor [1946], Wishart [1950], Quenouille [1952], Bennett and Franklin [1954]. Of the last two 
groups only Quenouille seems to recognize that Y: is not usually to be tested against internal error. 
I am not acquainted with sociology texts. 
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controversial. This is perhaps due to absence of good examples where 
it is relevant, but may also be attributable to unfortunate examples 
used by the originators and leading expositors of this feature of analysis 
of covariance. Better examples are still needed. (The ones used 
above are not yet good enough!) 

(1) E. S. Pearson [1934] first formulated the complete breakdown of 
the sum of squares. Its application (with computations by Miss F. E. 
James on data presented by Wilsdon) concerned the regressions of 
strengths of concretes on strengths of mortars made from the same 
cements. Unresolved difficulties in both design and analysis may 
now be recognized. 

Welch [1935] simultaneously used likelihood ratio methods to 
derive the analysis of tests for collinearity, and illustrated on the same 
data. 

(2) The contrast of external versus internal regression was next 
presented by Wishart and Sanders [1936] with a clarity of algebraic 
exposition which probably cannot be improved. Their two examples 
(a theoretical one represented by their figure 2, and one on live data) 
consider variates which are both affected by treatments; comparison 
of the two regressions may therefore be relevant. But the discussion 
purports to represent ordinary covariance usage to eliminate from 
estimated treatment means some of the random variation associated 
with an extraneous variable. It shows no recognition of the cireumstance 
that when covariance is used in this way the regression of treatment 
means on @ is irrelevant, and conversely that when both variates are 
affected by treatments and it is their relationship which matters the 
adjusted means are artifacts. 

(3) Kendall ({1955] sec. 24.30) gives what is perhaps the best 
available algebraic exposition of the complete regression heterogeneity 
analysis of hierarchical classification, closely following Pearson [1934] 
but with a simpler and more straightforward example. (Any edition 
may be used for the mathematical exposition but the 1955 impression 
should be consulted for interpretation. A mischievous Puck seems to 
have played with the earlier text to mate systematic and random 
effects and accidentally invert ratios—mischances obvious to experts 
but worrying to students.) However the following ambiguities, more 
or less latent in Pearson’s paper which is being followed, still remain. 

Kendall follows the practice of those who take a likelihood ratio 
approach to multivariate tests for heterogeneity (as illustrated by 
Welch [1935] and Pearson and Wilks [1933]) whereby one first does a 
pooled test and then subdivides if required. This raises the thorny 
problem “‘to pool or not to pool?” and gets everyone a little confused. 
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For example the theoretical exposition in effect says, if the pooled test 
is insignificant consider next the details; whereas the sample says, the 
pooled test is sigitificant so go on. This seems typical of the progression 
in multivariate tests! It seems to me one were better to decide jn 
advance the points at which a null hypothesis may be expected to 
fail, go directly there, and stand by the result. For example, if the 
analysis of covariance is being done for control of extraneous variability 
the external versus internal regression heterogeneity is irrelevant, and 
heterogeneity of internal regressions is best left in as part of random 
error. If the experiment has been duly randomized in design, so have 
the consequences of any regression heterogeneity and attempts to take 
it out get one into supplementary difficulties: to consider regression 
differences is to go on to something the experiment was not designed to 
explore and for which it is unlikely to be efficient. , 

Secondly, Kendall implies that the comparative efficiency of x as 
a predictor for another character in different groups of individuals can 
be judged from the magnitudes of the regression coefficients in each 
group. But efficiency of prediction is independent of slope of regression. 
It is determined by the precision of prediction—that is in general by 
variance of the predictand about the regression. (For special cases 
one may consider also precision with which regressions are estimated 
from a given size of sample, which depends on dispersion of x, as well 
as on variance about regression, but is still independent of slope.) 

Finally Kendall states that S,(Y? in this paper) should be compared 
to internal error, whereas we have seen that either it is irrelevant, or 
that when it has meaning the error of by , and hence also of Y, , depends 
on deviations about the external regression. 

Models: No work seems to have been done on constructing mathe- 
matical models for situations where a between treatment regression 
may be of interest and different from an internal regression. The 
task seems to be less simple than might be anticipated. A suitable 
form may need to be derived for each case individually with attention 
to the physical or biological circumstances. 

Unpleasant complications can arise in ill-planned experiments. For 
example in the urea clearance work discussed by Smith [1951] men were 
fed supplementary urea quite erratically; and long term changes to re- 
duced protein diets were confounded with renal defect. Consequently 
variation between men introduced almost inextricable confusion of re- 
sponses corresponding to natural differences between men, to a day or 
two of supplemantary urea in diet, to long term diet effects, and to 
ordinary day-to-day fluctuations. 

If an independent variable, x, were controllable its variation could 
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be imposed as a cross treatment. Comparisons of external and internal 
regressions seem to become relevant only when x varies within treat- 
ments essentially as a random variable and must be so regarded. This 
makes complications for evaluating expectations of functions of obser- 
vations and perhaps only criteria of consistency can be handled in 
elementary manner. Some tentative endeavors to construct models 
for the foregoing examples seem as yet too crude to be worth presenting 
at this time. 
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THE ANALYSIS OF COVARIANCE FOR 
INCOMPLETE BLOCK DESIGNS* 


MarvIN ZELEN 


National Bureau of Standards, Washington, D.C., U.S.A. 


1. INTRODUCTION 


In any experiment the careful experimenter is always trying to 
“control” all factors which influence the final results of the experiment. 
In many experiments, however, it is not practical to control all factors 
even though these factors can be measured quantitatively. For example, 
in calibrating meter-bars, the temperature of the chamber where the 
measurements are taken cannot be held constant for repeat measure- 
ments, thus necessitating a temperature correction to the measurement. 
In other experimental situations the innate capacity of the experimental 
units to respond to the treatments applied to them, is so highly variable 
that small or even moderate differential treatment responses may be 
completely obscured. This is the case (say) in applying treatments to 
used storage batteries where the condition of every cell is highly variable 
in its ability to respond to a charge. In still other experiments a drift in 
the measuring equipment is present which may not have been taken 
into account in the experimental design. 

All of the above experimental problems can be handled most effec- 
tively by the analysis of covariance technique which was first introduced 
by R. A. Fisher [13, 5th edition] in 1934. Thus, in the meter-bar 
measurements, the corrections applied can be made to depend only on 
the results of the particular measurements themselves, rather than on 
temperature corrections obtained independently.** The sensitivity of 
the battery experiment can be markedly improved by applying a 


*Paper presented at the joint meeting of the Biometric Society and the American Statistical 
Association, Detroit, Michigan, September 1956. 

*In practice the range of temperatures involved is so minute that the correction so obtained 
would itself be very imprecise, and recourse is taken to corrections derived from special determina- 
tions involving a wide range of temperatures. In those cases where the temperature range is relatively 
large or when there is a lag between the temperature of the meter bars and that of the chamber, the 
analysis of covariance can be used to make the adjustment for temperature. 
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preliminary small charge to each cell before any treatments are applied, 
then discharging each cell and using this preliminary discharge data to 
adjust the final test results by taking into account the initial ability of 
each cell to respond to a charge. The effects of drift in an instrument 
can, in many cases, be approximated by a polynomial expression of 
time, thus allowing measurements to be “‘corrected”’ for drift. 

Covariance analysis can also be used for analyzing experiments 
when through unforeseen circumstances several observations are 
missing. If only one observation is missing, or if a very simple experi- 
mental design is used, the analysis may not be unduly complicated; 
however for the more complicated designs the absence of measurements 
presents a major problem in the analysis. For these cases, covariance 
analysis can be readily applied, resulting in a relatively simple analysis. 
The procedure first suggested by Bartlett [3] is to let the value of the 
missing measurements be zero, and introduce concomitant variates 
corresponding to each missing observation such that the value of the 
concomitant variate is +1 for the missing unit and zero elsewhere. 
The number of concomitant variates will be equal to the number of 
missing experimental units. 

Still another use for covariance analysis as pointed out by Bartlett 
[2] is to adjust treatment effects for systematic differences between 
experimental units in a non-random experiment. In this situation if 
the effects of treatment differences disappear after adjusting for initial 
differences, then one can conclude that the treatments do not differ. 
However, if after adjusting for initial differences among experimental 
units the treatment differences still remain, then the conclusion that 
the treatments actually differ is not necessarily a valid conclusion. 
Although an exact functional relationship between the dependent and 
independent variates may not be known, an approximate relationship 
may be sufficient to adjust for the effects of non-randomness. Thus, if 
differences among treatments do not disappear after making such an 
adjustment, then we may conclude (i) treatments actually differ or (ii) 
treatments do not differ, but the functional relationship is not known 
to a sufficient approximation to adjust adequately for the non-random 
nature of the experiment. The conclusion that treatments actually 
differ will only be valid if the explicit form of the adjustment makes 
use of a known functional relationship between the dependent and 
independent variates. 

In the simplest covariance situation the experimenter applies treat- 
ments to experimental units and takes measurements (denoted by y) 
which he wishes to analyze. However, associated with each experi- 
mental unit is an uncontrolled factor, sometimes referred to as a con- 
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comitant variate, which can be measured or observed at a value x. 
The value x is completely independent of the treatments subsequently 
applied, but does reflect the variation among the experimental units 
with respect to uncontrolled factors. If the response y of the experi- 
mental unit to a treatment is affected by the particular value x asso- 
ciated with an uncontrolled factor, then it would be to the experimenter’s 
advantage to “adjust” the measurement for the uncontrolled factor. 
This “adjustment”? may take the form of a linear correction to the 
treatment estimates. That is, if the measurement y varies linearly 
with z, then y — Bz is the adjusted value for the measurement, where 
B is a constant which can be estimated from the present data and need 
not depend on other measurements. 

Many of the published applications of covariance analysis have 
dealt with experimental situations where the measurements conformed 
to a completely randomized design or to randomized block designs, 
ef. Anderson and Bancroft [1], Snedecor [18], Wishart [21]. The use 
of the analysis of covariance applied to split-plot experiment designs has 
been discussed by Bartlett [3], Cochran [7], Kempthorne [15, section 
19.7], and Truett and Smith [19]. A monograph by Cochran, Cox, and 
Eckhardt [8] gives both the inter- and intra-block covariance analysis 
for triple lattices. Cornish [9] has discussed the intra-block analysis 
applied to balanced incomplete block designs. Federer [11] has sum- 
marized the use of covariance methods applied to completely randomized 
and randomized block designs, Latin square designs, split plot and 
split block designs, and various lattice designs. The book by Rao [17] 
derives the general equations for covariance which are applicable to 
any experiment design. 

This paper, in Section 2, reviews briefly the procedure for the intra- 
block analysis of covariance when applied to any type of experimental 
design for p concomitant variates. Section 3 deals with the recovery 
of inter-block information; section 4 gives formulae for application to 
balanced and partially balanced (2 associate classes) incomplete block 
designs: section 5 treats a more general case of covariance analysis 
where the adjustment for the treatment response depends on the differ- 
ential block response; and finally section 6 consists of a numerical 
example illustrating the covariance analysis computations for a balanced 
incomplete block design. - 


2. INTRA-BLOCK ANALYSIS 


In many of the expositions and applications of the analysis of 
covariance it “appears” that different formulae are needed for different 
experiment designs. This is only illusory. Actually, the formulae for 


. 
\ 
. 


312 BIOMETRICS, SEPTEMBER 1957 


applying the analysis of covariance to various experiment designs is the 
same for all designs. The results in this section hold for any experiment 
design or two-way non-orthogonal classification and are a straight- 
forward consequence of the general linear hypothesis. These results 
were first obtained by Rao [17] and are included for completeness. 
Consider an experimental plan having v treatments, arranged in b 
blocks, such that each block contains k experimental units and each 
treatment occurs in r blocks. Let the measurement for the ith treat- 
ment in the jth block be y,; , and let there be p concomitant variates 
denoted by x{"? (w = 1, 2, --- , p) associated with each measurement. 
Then the mathematical model underlying the intra-block analysis is 


(2.1) ti; +b, + Baxi}? + 

where u is a constant common to all measurements, ¢,; is the systematic 
effect for the zth treatment, b; is the systematic effect for the jth block, 
B,, is the regression of y,; on x‘;? for (w = 1, 2, --- , p) and e,, isa 
random variable having zero expectation and constant but unknown 
variance a”. In making all tests of significance it will be further assumed 
that the e;, are independent identically distributed random variables 
following a normal distribution. 

2.1 Notation: It will be convenient to adopt the following notation 
where it will be understood that a summation with respect to 7 ranges 
from 7 = 1, 2, --- , v; a summation with respect to j ranges from j = 
, 2, -++ , 6; and a summation with respect to uw or w ranges from 
u,w=1,2,---,p. 

Let 

n,; = {1 if treatment 7 appears in the jth block 
0 otherwise 


= (treatment total and average) 


(2.2) 94 = (block total and average) 


2 = (grand total and average) 


and similar expressions for XG?, X°°; #0°, #. 
Also let 


= Dan; adjusted treatment totals, 
: (treatment totals adjusted by 
= — block means) 


(2.3) 


\ 
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i, = estimate for the 7th treatment obtained from intra-block analysis; 
i, = estimate for the ith treatment obtained from intra-block analysis 
if B, = Ofor w = 1, 2, --- , p; 
j., = quantity analogous to /;, , obtained by using «{7’ in place of y;; . 
Too = 
adjusted treatment sum of 
(2.4) Pow = squares and products. 
00 
F k bk squares and products 
So = — Y.. 
00 fon Vii bk 
tage) 2 total sum of squares and 
(2.6) Soe Vii bk products 
= p(w) 
Suv = Lij 
Eoo = Soo — boo — Too | 
" error sum of squares and 
(2.7) am! Sow Dow | products 


2.2 Analysis: The estimates of the unknown parameters yu, t; ,b; , B.. , 
can be found by minimizing the sum of squares of the errors a 
with respect to each parameter. On differentiating the sum of squares 
of the errors and using the constraints >>; t; = >; b; = 0, the reduced 
normal equations for the treatment and regression coefficient estimates 
can be written as 


(2.8) é; > é, + Qio (i = 2, v) 
(2.9) Ewh, = Ew 


where \;, = >.; %;2.; = number of blocks in which treatments 7 and 
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s appear together. Hence, the solution for the é; can be alternatively 
written as 


where /,, is the treatment estimate if all B, = 0. The é;, are a similar 
set of quantities obtained by replacing Q, by Q,,, for w = 1, 2, +--+, p. 
Further, by some elementary algebra, we can demonstrate that /,, 
and B,, are uncorrelated. Thus, the procedures for obtaining the esti- 
mates of the adjusted treatment effects, (adjusted or corrected for the 
effects of the concomitant variates) are: first, find the usual treatment 
estimates (f,,) neglecting the presence of the concomitant variates; 
second, repeat the calculations with the same formulae, but using the 
concomitant variates x{"’ (w = 1, 2, --- , p) in place of y;; , and thus 
obtain the quantities ?,,, . Substitution in equation (2.10) gives the 
final result. 

The corresponding residual sum of squares after adjusting for the 
effects of the concomitant variates is 


(2.11) S. = Ew — > 


having n, = (bk — b — »v — p + 1) degrees of freedom. If the con- 
comitant variates had been completely neglected, then the residual 
variance would be £,. which would be an inflated estimate of the 
residual sum of squares. 

In the analysis of variance it is common to test the null hypothesis 
that all treatment effects are the same, e.g. H,(t, = t, = --- = t, = 0). 
This can be tested by the F-ratio having (v — 1) and n, degrees of 
freedom, 


S/o 1) 


S./n, 


(2.12) F= 
where the treatment sum of squares, S, , is 

(2.13) S.=Twot > BE. — > + Too] 

and B,, satisfies the linear equations 

(2.14) > BelEuo + Tew] = Eu + To = 1,2,--- ,p). 


A test for the null hypothesis that the concomitant variates have 
no effect, ie. H,(B, = B, = --- = B, = 0) can be made using the 
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F-ratio 


( p B.E..)/p 
(2.15) F= 


having p and n, degrees of freedom. 


The variance of the difference between two treatment estimates 
é, and ¢, (say) is given by 


where var (445 — fy) is the variance between the treatment estimates 
ignoring the concomitant information and c¢,,, are the elements of the 
inverse matrix to || Z,,, ||. 

The variances and covariances for the B,, are 


= 2 
2.17) B, = Cut 
cov (B, , B.) = 


The case most often met in practice is when there is only one con- 


comitant variate, ie. p = 1. The computational procedure for this 
ease is illustrated by a numerical example in section 6. 


3. INTER-BLOCK ANALYSIS 


In the case of incomplete block designs (designs in which the number 
of treatments is larger than the number of experimental units in a 
block, i.e. v > k) additional information can be recovered from the 
block totals provided one can assume that the block effects are random 
variables. One can use this additional inter-block information in two 
ways; e.g. (1) to find inter-block estimates of the treatments and regres- 
sion coefficients which are independent of the intra-block estimates or 
(2) to find combined estimates which depend on both the intra- and 
inter-block information. Section 3.2 gives the formulae for finding the 
inter-block estimates and also shows how it may be possible to duplicate 
all statistical tests made in the intra-block analysis. Section 3.3 gives 
the necessary formulae for obtaining the combined estimates. These 
are more precise than if either the intra- or inter-block estimates had 
been used alone. 

With respect to the non-covariance situation, most statisticians agree 
that the inter-block analysis may be important if the number of blocks 
is “large” or if the variability between blocks is ‘“small.’”’ However, 
in the analysis of covariance, the inter-block analysis will be important 
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if the variability of the concomitant variate is larger for “between 
blocks” as compared to the variability “‘within blocks.” This situation 
may allow more precise inter-block estimates of the regression coefficients 
as compared to the corresponding intra-block estimates. 


3.1 Notation 


A block total (summing equation (2.1) over i) will have the under- 
lying mathematical model 


3.1) Yi, Dnt +k Bae? + mises + kb; 


where b; and e¢,; are assumed to be mutually uncorrelated random 
variables such that 


Ee, ;) 0, var (€,;) 
E(b;) = 0, var (b;) = o; 


(For all tests of significance it is further assumed that e,; and b, are 
normally distributed variates). 

The model (3.1) assumes that the regression of Y,, on X‘”? is B, 
which is the same regression as the regression of y;; on 2x{"? described 
in the intra-block analysis. Some statisticians, however, have advocated 
a more general model which allows the intra-block regression coefficients 
to be different from the inter-block regression. In this paper, all models 
are such that the intra-block regression is the same as that for the 
inter-block regression. It is difficult for this writer to visualize situations 
allowing separate regressions. That is, if the regression of y on z is B, 
it seems that the regression of their averages should also be the same. 
If the two independent (intra- and inter-block) estimates of the respec- 
tive regression coefficients differ more than by their indicated precisions. 
this is evidence of the inapplicability of the model, rather than as 
evidence that there are separate intra- and inter-block regressions. 

It will be convenient to denote by a prime (’) quantities derived 
for the inter-block analysis that have counter-parts in the intra-block 
analysis. Thus, 


t/ = estimate for the 7th treatment obtained only from inter 
block analysis. 
Bi = estimate for B, obtained from the inter-block analysis. 
(3.2)4t!, = estimate for the 7th treatment obtained from the inter 
block analysis if B, = 0 for w = 1, 2, p. 
t’,, = quantity obtained by using z{”? in place of y;; and solving 
for . 
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Also let 
Qo = Yi. Qin — = 


00 = Doo ’ = Ouw — Tr. Eve Dow Teo 


Since both the intra- and inter-block analyses yield independent 
treatment estimates, these estimates can be combined to form a single 
estimate having a smaller variance than either the intra- or inter-block 


estimates. A bar (—) is used to denote those quantities which occur in 
the combined estimate. Thus 


(3.3) 


(?: = combined estimate for treatment 7. 
(3.4)| 740 = combined treatment estimate for treatment 7 if B, = 0 
for (¢ = 1,2, ---, p). 
combined estimate for B,, 


B w 
Also let 


(3.5) | 1 


+ kot 
and define 


8.6) = WQi0 + W’ 
= + W’ 
| Eu = WEw + 
3.7) 


Eow WEow + Ww’ 
WE... + . 
3.2 Hstimates based only on inter-block information 


The estimates are obtained by minimizing the residual sum of 
squares (1/k) >>; (kb; + >>; n,;€:;]° with respect to each of the unknown 
parameters. This results in the normal equations 


(3.9) = w= 1,2,--- ,p. 


(3.8) pat UB. = 1,2, 
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from which the ¢/ can be written as 
(3.10) i= the — Bsti. 


The residual sum of squares will have ni = (b — v — p) degrees of 
freedom and is given by 


S/ will be independent of S, , has the expected value E(S/) = n!(o" + ko?), 
and follows a + koz)x’ distribution with degrees of freedom. 

Thus all results derived for the intra-block analysis can be found 
for the inter-block analysis. LTurthermore, the parameter estimates 
t’ , B/ are independent of the corresponding intra-block estimates. 
The variance of the difference between two treatment estimates 1’ and 
t{ (say) is 
(3.12) var (t% — t{) = var (tho — tho) 

+ (le — — + 


where c/,, are the elements of the inverse matrix to || F7,, |!. 

Furthermore all tests of significance made in the intra-block analysis 
can be duplicated using only the inter-block estimates. Thus the 
equations (2.12)—(2.17) carry over directly by substituting primed (’) 
quantities for circumflex (~) quantities, n’ = b — p — v for n, , and 
+ ko;) foro”. In addition, all of these inter-block tests of significance 
are independent of their intra-block counterparts. 

Thus, from the b block totals we can estimate (p + v) parameters, 
i.e., the grand mean yu, the parameters ¢t; (¢ = 1, 2, --- , v, of which 
» — 1 are independent) and the regression coefficients B, (w = 1, 
2, --- , p). Hence, the number of blocks must be equal to or greater 
than the total number of parameters, i.e. b > (v + p). Furthermore, if 
the blocks can be arranged in m replication groups, these (m — 1) de- 
grees of freedom can only be ,used for estimating the regression co- 
efficients B,, and the variability between blocks. Therefore, a necessary 
and sufficient condition for estimating all (p + v) parameters is that 


b — v > maximum (m — 1, p). 


In order to obtain an estimate of (e” + kez) and thus be able to dupli- 
cate all tests in the inter-block analysis, it is necessary that the inequality 
sign hold or that m > 1. Further details can be found in [22]. 


3.3 The combination of significance tests 


Since there exist two independent variance-ratio tests for testing 
any linear hypothesis concerning the treatments or regression coefficients, 


| 
| 
w 
u, wu 
= 
we 
4 
a 


INCOMPLETE BLOCK DESIGNS 319 


it seems desirable to combine the two tests into a single test. A proposed 
procedure is to let P,(¢ = 1, 2) refer to the probabilities that the calcu- 
lated F; value will be exceeded under the null hypothesis, i.e. 


(3.13) P; = P{F > F; | Ho} 7=1,2 


where the subscripts 1 and 2 refer to the intra- and inter-block analyses 
respectively. Then a combined test can be made using the statistic 


(3.14) C = P,P; 
where 

(3.15) Eo + ke?’ 


and FE refers to the efficiency of the experiment design. (The null 
hypothesis is rejected with level of significance a if C < C,). In most 
situations o°/(o” + ko?) will be replaced by its estimate s?/s:-. Then 
the test will only be an approximate one. The power of this test is 
investigated in [23]. A table of critical values of C, for a = .01, .05 is 
given in [22]. Other methods for combining significance tests are dis- 
cussed by Birnbaum [4], Mosteller and Bush [16], and Wallis [20]. 


3.4 Combined estimates 


The most precise treatment estimates are obtained by weighting 
both the intra- and inter-block estimates by the reciprocal of their 
respective variances. This results in estimates of the form 


where the B,, satisfy the equations 
(3.17) = = 1,2, , p). 
In the practical situation the weights 
W =1/c’,W’ = 
o + ko, 


are not known and have to be estimated. For this purpose we have 
estimates of and + ko? which follow independent o’x’ and 
(0° + ko?)x’ distributions respectively when b > max (p, m — 1) or 
when m > 1. If the inter-block analysis is made without this condition 
holding, then no estimate of (o” + ko;) independent of S, is available. 
Then one can determine an estimate of (o” + ko;), which is not inde- 
pendent of S, , by an easily apparent modification of the usual analysis 
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of variance formulae applied to the recovery of inter-block information, 
ef. Federer [11]. 


4. FORMULAE FOR APPLICATION TO BALANCED AND PARTIALLY 
BALANCED INCOMPLETE BLOCK DESIGNS 


For convenience, the quantities f;, , t/, , £9 the variance of a differ- 
ence, and the efficiency factor Z for the balanced and partially balanced 
designs (two associate classes) are listed below 


Balanced incomplete block designs 


(4.1) 

(4.2) = 

(4.4) 

(4.5) var — fo) = 


Partially balanced incomplete block designs 


(4.7) 


nk — 1) + + 


(48) thy = + + 


1 
(4.9) lio = — + [kQio + d,S8,(Qio) + d,8.(Qio)], 


(v — 1)(k — 1) 
nik — ¢,) + n(k — 


where S,,(Qio), Sm(Q%o); Sm(Qio) is the sum of the mth associates Qo , 
Q’. , Qio for treatment 7; , cf , ci are defined by 


(4.10) E= 


(4.11) (m= 1,2) 


and the quantities c,, , d,, , , A, and H are the usual parameters 
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for partially balanced designs, cf. Bose and Shimamoto [5] or Bose, 
Clatworthy, and Shrikhande [6]. 
If a and b are mth associates, then 


(4.12) var (lo — = 
(4.13) var — th) = 2(o" + — cn) 


For some partially balanced designs, it will not be possible to esti- 
mate all v — 1 treatment contrasts. This will occur when the denomi- 
nator of (4.11) is zero, ie. A — rH + r° = 0. When this condition 
holds, it implies that there exist (say) s orthogonal contrasts among the 
block totals, having expectations independent of the treatments and 
s>b-—v. These s contrasts can be used for estimating the p regression 
coefficients (provided s > p), and o” + ko}. The remaining (b — s) 
— 1 contrasts among the blocks will have expectations dependent on 
the treatments. However, the number of orthogonal treatment con- 
trasts is v — 1 which is greater than the (b — s) — 1 contrasts among 
the block totals available for estimating all the treatment contrasts, 
ie.v > b — s. Therefore not all treatment contrasts will be estimable. 
This will be especially true for the singular and semi-regular group 
divisible designs. 


5. ANALYSIS OF COVARIANCE WHEN THE REGRESSION 
COEFFICIENTS DEPEND ON THE BLOCKS 

All the techniques for the analysis of covariance discussed in previous 
sections were based on the assumption that the regression coefficients 
are independent of the blocks. In some experimental situations this 
assumption is not satisfied. This section outlines the appropriate 
covariance analysis for one concomitant variate when the regression 
coefficient is dependent on the block. 

This model coincides with that suggested by C. P. Cox [10] where 
the mathematical model (excluding the ¢;) can be regarded as a curved 
sheet representing the background of the experimental material in 
three dimensions. If the concomitant variate represents rows and the 
blocks represent columns, the indicated model would allow the columns 
to be discrete and the variation within them represented by linear 
regressions. 

Let the mathematical model underlying the measurement for the 
ith treatment in the jth block be 


(5.1) +b + Bila; — + 


= 
os 
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where the quantities u, ¢; , and b; have the same meaning as in previous 
sections, but B;(j = 1, 2, --- , b) is a regression coefficient associated 
with the jth block. 

The normal equations are obtained by differentiating the residual 
sum of squares 


with respect tou, ¢; ,b;,and B;. Using the restraints =0, 
the normal equations take the form 
(5.2) =A 
(53) =raté) + + — 
i 


+ B, — 


(5.5) 


~ 


Substituting (5.2) and (5.4) in (5.3) one can write (5.3) as 


+ G=1,2,---,0 


where q;; = — #.;). 
Define the quantities 7,; by 


Note that the 7;; are the quantities calculated from (5.6) by replacing 
Qo by q;; and setting B; = 0 for all j. 
Then by a rearrangement the ¢; can be written as 


(5.8) = bin — 


where é,;. are as defined in section 2. Substituting (5.8) in (5.5) results 
in the equations for B; written in the simple form 


(5.9) BE;; — 8.7,;=E;, G=1,2,--:,b) 
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where 


t 


(5.10) 


The variance of the difference between two treatment estimates is 
A A A A 2 
(5.11) var (¢, — t,) = var (tay — toy) + (Fan — — Fai 


where c,; are the elements of the inverse matrix arising from the set of 
linear equations given by (5.9). 

The computations can conveniently be arranged in an analysis of 
variance table as shown in Table 1. 

The test of significance for the null hypothesis that all treatment 
effects are zero, i.e. H(t; = t, = +--+ = t, = 0) is given by the F-ratio 
F = s7/s; having (v — 1) andn, = (bk — 2b — v + 1) degrees of freedom. 

A test for the null hypothesis that the regression coefficients do not 
depend on the blocks is given by the F-ratio F = s?/s? having (b — 1) 
and n, degrees of freedom. 


6. EXAMPLE WITH ONE CONCOMITANT VARIATE 


The usual application of the analysis of covariance, as first described 
by Fisher, is to adjust the final estimates by means of a concomitant 
set of data in order to improve the precision of the final estimates. 
However, another use for the analysis of covariance (or analysis of 
covariance type calculations) is to employ a general regression model 
to relate a dependent variable in terms of both qualitative and quanta- 
tive variables. This section contains such an example. 

A certain film-type composition resistor used in electronic equipment 
is of the type which is mounted on a small ceramic plate. An investiga- 
tion was recently conducted at the National Bureau of Standards to 
determine the effects of four different geometrical shapes on the current 
noise of these resistors. These geometric shapes were taken to be 
rectangular parallelepipeds (all having the same thickness) formed by 
taking all four combinations of 2 widths (w, , w,) and 2 lengths (J, , l.). 
For convenience the four combinations of length and width, 1,w, , 
lw, , l,w, ,l,w, , will be denoted by a, b, c, and d respectively. 

Since for this type of resistor the noise of an individual resistor 


varies with its resistance, an appropriate mathematical model underlying 
the measurements is 


= ett, +b; + Bay; + 


g 
ts 


R 1957 


4 


BIOMETRICS, SEPTEMBE 


22, 
Agr 
“cy 
mC 
1 


al + 


jo wing 


Wo 1g NO NOISSAUDAYY HLIM FONVIUY. 


| 


(I — 4) 


(I — 4) 


Jo 


UOISSAIZAI ULOTY 


| 
| 
| 
| 
| 


(peysnfpvun) 


syoo]q 
paysn[pv) 


| aounog 


AOD AO SISATIVNY 


3: 
& 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
1 
| 
| 
| 
‘ 
| 
| 
| | 
Fos 


INCOMPLETE BLOCK DESIGNS 325 


where ¥/;; is the logarithm of the noise measurement of the ith type 
resistor (treatment) in the jth plate (block), z,;; is the logarithm of the 
resistance for this resistor, u is a constant common to all observations, 
t, is the effect of the 7th treatment, and B is a constant of proportionality 
between “noise” and “resistance”. IF urthermore, the plate effects 
{b;} and the {e,;} are assumed to be sequences of independent random 
variables following normal distributions with mean zero and variance 
and o° respectively. 

This experimental situation does not conform to the usual assump- 
tions underlying a covariance analysis, as here, the concomitant variate 
(resistance) depends partly on the shape of the resistor. However, 
from previous work it is known that a linear relation exists between 
noise and resistance (both on logarithm scales). The purpose of this 
experiment was to determine if the intercept of this relation varies 
with the geometry of the resistor. This is equivalent to testing whether 
all treatment effects ¢; are equal to zero. Thus, the fact that the value 
of the resistance depends partly on the shape of the resistor in no way 
invalidates the test that the intercept (wu + ¢;) varies with the shape 
of the resistor. 

Since only three different resistors can be mounted on any one 
ceramic plate, a balanced incomplete block design involving 36 resistors 
mounted on 12 plates was used for the basic experimental plan. The 
plates determine a “natural” grouping of the experimental material and 
hence can be regarded as constituting the ‘‘blocks’’ in the experiment. 
The parameters of the experimental design are 


Table 2A summarizes the coded measurements and gives the experi- 
mental plan. The four treatments are denoted by the letters a, b, c, d 
and the numbers in parentheses represent the values for the concomitant 
variate. 

The values for Q;. , Q!. , é;, for z = x, y are summarized in Table 
2B where the treatment estimates, ignoring the concomitant variate, 
are obtained using equations (4.1) and (4.2) e.g. 


6.1) (intra-block) 


= (inter-block) 


q 
v= 4, k = 3, b= 12, r=9, A = 6, 
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TABLE 2A 


EXPERIMENTAL PLAN 


Observations and experimental plan 


Blocks 

1 a 

2 b 

3 

4 d 

5 
6 b 

7 
8 d 
9 a 
10 b 

11 c 

12 d 


~~ 


1.11 .95 .82 
(.98) c (.66) d (.47) 
1.22 .97 1.70 
(.88) d (.58) a (1.14) 
1.52 1.60 1.11 
(.79) a (1.08) b (.84) 
1.18 1.22 1.54 
(.58) b (.88) c (.80) 
1.57 1.33 .93 
1.08) c (.79) d (.49) 
1.12 .93 1.69 
(.82) d (.51) a (1.11) 
1.26 1.35 1.06 
(.69) (1.00) b (.80) 
1.12 1.09 1.29 
(.48) b (.79) (.70) 
1.60 1.82 1.09 
1.12) c (.82) d (.57) 
1.03 .94 1.66 
(.80) d (.52) a (1.10) 
1.24 1.31 | 
(.71) a (1.00) b (.82) 

93 1.09 1.27 
(.47) b (.78) c (.68) 


TABLE 2B 


TREATMENT EstiMATEs IGNORING CONCOMITANT VARIATE 


ment Qiy 


Intra-block 


Inter-block 
Qiz liz ty the 


2.243 333 


1.050 000 


a .280 417 
b |—1.133 333 —.141 667 
c 
d 


2.220 000 .277 500) .154 167 
0.293 333 667|— .009 167 


.307 500 
.034 167 


-131 250 —0.316 667 — .039 583|— .022 500 —.125 833 
—2.160 000 —.270 000 —2.196 667 — .274 583)—.122 500 — .215 833 


The computational procedure for the intra-block analysis of variance 
proceeds in a similar manner as for the simple randomized block case. 
The appropriate analysis of variance of the sums of products and cross 
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products can be summarized as in Table 2C. The treatment mean 
square for testing the null hypothesis that all treatment effects are the 
same, is most conveniently obtained by forming a reduced analysis of 
yariance which is summarized in Table 2D. 

The analogous results for the inter-block analyses can conveniently 
be arranged in a single table. These results are summarized in Table 2E. 


TABLE 2C 


AUXILIARY INTRA-BLOCK ANALYSIS OF VARIANCE 


Source of Degrees of Sum of squares and products 
Variation freedom y? xy 


Blocks (unadjusted 


for treatment) Il 0.641 898 0.258 630 0.222 631 
Treatments (adjusted 

for blocks) 3 1.510 637 1.132 506 1.242 508 
Error (intra-block) 21 0.278 896 0.038 661 0.011 025 

Total 35 2.431 431 1.429 797 1.476 164 


The reduced error sum of squares, corrected for the effects of the 
concomitant variate, in Tables2D and 2E are obtained from the formulae 


Gntraclock) S.=E,, = 0.143 325 df. = 20 
(6.2) 

inter block) Si= Kk, = 0.087 402 df.=7 


Thus the estimates of B, for the intra- and inter-block analyses 
respectively, result in 


3.506 667 


(6.3) 


= _ 2.896 268 


Hence, using (6.3) the treatment estimates corrected for the concomitant 
variate can be calculated. The second and third columns of Table 2F 
summarize the corrected treatment estimates for both the intra- and 
inter-block analyses. The fourth column gives the combined treatment 
estimates (equation 3.16) where the weights W and W’ are taken to be 
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| 139.542 997 
6.4) 
= 80.089 700 


TABLE 2F 


TREATMENT EstTIMATES s CorrEc TED FOR THE CONCOMITANT VARIATE 


Treatments é; (intra-block) (inter-block) 7; (combined) 
a | —0.692 683 —0.714 910 —0.561 646 —0.503 872 
b —0.270 246 —0.105 732 —0.241 653 —0.245 296 
c 0.270 054 0.333 138 0.256 232 0.243 122 
d 0.692 871 0.487 502 0.547 068 0.506 045 


The estimated variances for B and B’ are 


.007 166 


var B = = 0.649 977 
E 11 025 
(6.5) zz 0 011 025 
72 
var B’ = 012 486 _ = 0.193 611 


490 


Note that the intra-block estimated variance is over three times larger 
than the inter-block estimated variance. 

Since the variance between two adjusted treatment estimates is 
different for each comparison, Finney [12] has suggested using an 
average value of (f,, — @,,)". Hence for the intra-block analysis we have 


(6.6) average” var (t, — t,) = Ep [ + aaa =| = 0.069 111. 


A similar formula holds for the inter-block analysis 
“average” var — 


_ Ue? | |- 
= | = 0.045 384. 


(6.7) 


However, if the treatments adjusted for the concomitant variate 
are estimated using 


(6.8) = — Bt 


where B’ is the inter-block estimate of B, then the average variance of a 


‘ 

: 
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comparison between two treatments is 


2 
(6.9) var (7, — 7,) = E | = 0.017 926. 
Thus, the estimated variance is reduced by almost 3/4 over that of the 
usual intra-block variance. The last column of Table 2F gives the 
treatment estimates using (6.8). 
The two independent tests of significance for the null hypothesis 
that all treatment effects are the same are obtained by testing the 


error mean square against the treatment mean square obtained from 
Tables 2D and 2E, e.g. 


ntra-block af. = 3,90 
0.007 166 
(6.10) 
0.084682, 
(inter-block) F = 0.012 486 ~ 6.78 df. = 3,7 


Both F-ratios in (6.10) are highly significant. Thus, there is no 
real need to combine the two independent tests into a single test. 

Instead the combination of the two independent tests will be illus- 
trated by testing the interaction between lengths and widths. The 
contrast 


ax = (ec — d) — (a — b) 


can be used to test if an interaction exists between lengths and widths. 
If x is equal to zero, then there is no interaction between lengths and 
widths. The variance of the two independent estimates of m obtained 
from the intra- and inter-block analysis are 


(6.11) 


2 
var (1 — (ao + ka) 


Es 
respectively. Thus two independent ¢-tests can be made for the null 
hypothesis that = 0 against the alternative hypothesis that 7 # 0. 

When combining tests of significance in cases where the alternative 
hypothesis is two sided, a preferred procedure (cf. Fisher [14]) is to 
choose a direction as positive, obtain the associated probabilities o 
getting the observed value or a larger one, and combine the resulting 
probabilities. After combining these probabilities, a two-tailed test 
can be used for the final result. This technique will be illustrated here. 
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The two estimates for + and their associated estimated variances are 


A 


—().000 380, nm’ = +0.454 814, 
vart = 0.003 605, var 7’ 0.056 451, 


ll 


which give the ¢-ratios 
{ (intra-block) 
{Gnter-block) 


ll 


—0.0063 d.f. = 20 
1.9143 df. 


Hence, the probabilities of obtaining ¢-ratios equal to or larger than 
those above are 


I 


7. 


[P. = 0.50238 
LP, = 0.04857. 
The value of the weight function 6 is 
1-E: 
6 = .06 


and thus 
C = P,P’ = 0.445. 


Since the alternative hypothesis is actually two-sided, the critical values 
in [23] refer to the .02 and .10 levels of significance. If we adopt (say) 
a level of significance of a = .10, the critical value corresponding to 
= .06is = .047 (obtained by linear interpolation). Thus, one would 
accept the null hypothesis that there is no interaction between lengths 
and widths. 
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VARIANCE AND COVARIANCE ANALYSES FOR 
UNBALANCED CLASSIFICATIONS* 


Wa ter T. FEDERER 
Cornell University, Ithaca, N.Y., U.S.A. 


INTRODUCTION 


Yates [20, 21], Snedecor [16], Snedecor and Cox [17], Nair [12], 
Cochran [3], Stevens [18], Henderson [10], and Rao [15] have presented 
analysis of variance procedures for unbalanced classifications while 
Day and Fisher [5], Wilks [19], Hazel [9], Rao [15], Henderson [11], 
Das [4], and Federer [6, 7, 8] have discussed covariance analysis for 
unbalanced classifications. Bartlett [1, 2], Quenouille [14], Federer [7] 
and Outhwaite and Rutherford [13], among others, have discussed the 
use of dummy covariates to remove the effect of disproportion in 
particular unbalanced classifi: ations. 

The purpose of this paper is to present additional results for variance 
and covariance analyses with unbalanced classifications. In particular, 
the analyses are grouped as follows: 


Case I—Interaction absent; 

Case I[—Interaction present; the effects assumed to be fixed effects; 

Case I1I[—Interaction present; the interaction effects and at least 
one of the main effects of the factors represented in the 
interactions assumed to be random effects. 


A Case I variance analysis is known as “the method of fitting 
constants” and a Case II variance analysis is known as “the weighted 
squares of means analysis” in the literature [16, 17, 20, 21]. A Case III 
analysis of variance has been discussed in the literature for balanced 
classifications [e.g., see 7, ch. VIII], but not for unbalanced classifications. 
A covariance analysis for one-way classifications has been discussed 
by Day and Fisher [5]. Case I covariance analyses for unbalanced 
classifications have been presented by Das [4], Federer [6, 8], and Hazel 
[9]. Federer [6, 8] presented the Case II and the Case III linear covari- 
ance analyses for an unbalanced two-way classification, and included 


*Paper presented at the joint meeting of the Biometric Society, ENAR, the American Statisti- 
cal Association, and the Institute of Mathematical Statistics, Detroit, Sept. 7, 1956. 
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illustrative numerical examples. The additional results included in 
the present paper represent extensions of previous work [4, 6, 8] to 
unbalanced g-way classifications with several covariates. 


CASE I 


The linear model for a two-way classification (Table 1) with a 
covariate is: 


=ptrat Pp; + €ijh + BX £), (I-1) 


where uw = an effect common to all observations, 7; = an effect common 
to the 7th level of the first factor or classification, p; = an effect common 
to the jth level of the second factor or classification, €;;, = a random 
error component, € = arithmetic mean of the X’s, and 8 = the true 
regression of Y on X estimated from the residual line of the analysis 
of covariance (Table 2). With multiple regression the linear model 
becomes: 


b 


where 8, is the true partial regression of Y on the gth independent 
variate, X, , estimated from the residual line of the analysis of covariance 
and #, is the arithmetic mean of the X’s in the gth group. 

For the linear model in formula (I-2), the normal equations for 


A 


fi, 7; , ; , and g, , respectively, are: 


> Yin = =n at + > 7.58; (I-3) 
j=1 h=1 i=l i=l 
Yijn = Y,. (a+ + 258; 
i=1 h=1 (I-4) 
4 
v nij b 
=n. (a+ 6) + Dont: + — 2.4); (5) 
i=1 g=i 


(1-6) 
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In the above equations 


i=1 j=1 
and n,;; (= 0, 1, 2, ---) = number of individuals in the zjth subclass. 
The 1 + v + r + b normal equations may be reduced to v + b 
equations involving only the unknowns 7; and B, . Further reduction 
of the v + b equations to v equations involving only the 7; and the 
observations is possible, but not too desirable computationally. The 
8, values may be obtained from an analysis of covariance table similar 
to Table 2 [see 16, sec. 13.7], and then a set of v equations in 7, may 
be obtained, the ith equation being: 


j=1 


(1-7) 


b 

Xor.. — mites b= Qu. 

The v equations from (I-7) and the equation }-"_, #, = 0 yield unique 
values for the 7; . With solutions for the 7; , and with the additional 
equation 2 p; = 0, @ and 6; may now be evaluated; the adjusted 
means are @ + 7; and ~@ + 4, for the first and second classifications 
respectively. 

In the event that formula (I-1) is appropriate, an estimate of 8 is: 


Xin — > = Wi. 


W.. 


(1-8) 


Likewise, the kth equation of the v equations in 7, is: 


Limi. ) 
5. 
? zr 


The above v equations plus the equation ) pe #, = 0 result in unique 
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solutions for the 7, . The estimate of 8 is then obtained from formula 
(I-8) or 8 may be obtained directly from Table 2 as 8 = D,,/D,. . 
If only a variance analysis is to be performed, then each X,,;, — &, \ 
is set equal to zero in equations (I-1) to (I-7). In obtaining the sums 
of squares given in Table 2, analyses of variance are obtained for the 
X, and the Y variates and their cross products. Estimates of yu, 7; 
and p; , obtained from equations (I-1) to (I-7) with each (X,;;, — ,) 
set equal to zero, will be denoted as uw’ , pi , and 77. Estimates of u 
and p; obtained from equations (I-2), (I-3), (I-5), and >", 6; = 0 
when each 7; and each (X,,;, — @,) are set equal to zero will be denoted 
as”, and pj’. Likewise, the estimate of u and 7; obtained from equa- 
tions (I-2), (I-3), (I-4), and : #; = O when each ; and each 
(X,i;, — £,) are set equal to zero will be designated as u* and r*. The 
estimate of » obtained from (I-2) and (I-3) when each 7; , each §; , 


aut 


and each (X,,;, — are set equal to zero is = 9. 


TABLE 2 


ANALYSIS OF COVARIANCE (LINEAR) FOR A Two-way CLASSIFICATION 
Wirn Unequat NUMBERS IN THE SUBCLASSES—CASE I. 


Cross products 


Source of variation | Ds. yma x 
Total (corrected for mean) Tyy Try Tas 
Second factor (ignoring Ist factor) r—1 Ry Rey Rez 
First factor (eliminating 2nd factor) v—1 Vuy Vay Vez 


Adjusted sums of squares* 


D.f. Ss 

Residual n.—-r—v Diy = Dyy — Diy/Dez 
First factor + residual Why = Wy — 
First factor (eliminating regression 

and second factor) v—1 Viv = Wey — Diy 
Second factor + residual Ve) Usy = Uyy — Usy/Uss 
Second factor (eliminating regression 

and first factor) r—1 = 


= Vey + Dey; Wey = Vey + Dey; Wez = Vrz + Daz. 
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In computing the sums of squares for the variance analysis, the 
following procedure is used:* 
Total sum of squares —SS(u’’’) 


SS(u’’, pj’) — SS(u’’’) 
(I-11) 


SS(u*, 7%) — SS(u’’’) 


SS(u’, , — SS(w’’, 


r2 
i=1 


SS(u’, 4) SS(u*, 7*) 


i=l 


i=1 


M 


Total — SS(u’, p! , 77) 


t=1 j=1 =1 j=1 i 

The sums of squares for a classification eliminating the effects of 
regression, the mean, and the other classification are given in Table 2. 
Alternatively, the sum of squares for the first classification adjusted for 
the other effects is = SS(a, , 6; , 8) — SS(u*, , and B*), where 
uw’, p; , and 8° are obtained from the normal equations with each 
set equal to zero. 

The various sums of squares for the X variate are obtained in 4 


*SS(u’’, p’’;) is the sum of squares attributable to uw” and the p’’;; etc. 
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similar manner with yu! , 72; , p!; , u2’ , etc. being the corresponding 
estimates of the effects for the X variate. 

The various sums of cross products for the linear model given by 
formula (I-1) are computed as follows: 


j=1 h=1 


X,Y, 

i=l n;. n.. 

i=l 

= _ Ent.) (I-19) 


+ Zk... — 


The above formulae for sums of products are applicable for the gth 
X variate in a multiple covariance set-up. 

If the experiment is designed as a three-way classification with co- 
variates the linear model is:* 


b 
Y sins =pta,+ Vi + + €ijns + BAX Z,); (I-21) 


where » = a common mean effect, a; = effect common to the ith level of 
the first or A classification, y; = effect common to the jth level of the sec- 
ond or C classification, 5, = effect common to the Ath level of the third 
or D classification, €;;,7 = a random error effect, and 8, = true partial 
regression of Y.on X, estimated from the residual line in the analysis 
of covariance. There are no interaction terms for a Case I analysis. 
The normal equations for the various effects (@, &; , 7; , 6, , and B, , 
respectively) are: 


*This model holds for either a complete or incomplete factorial arrangement of the combina- 
tions. For an incomplete factorial arrangement (e.g. the latin square) care must be exercised in summing 
over the various subscripts. 
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a c d 
n + + = Y.... ; (1-22) 
i=1 7=1 h=1 
7 h 
(I-23) 


+ — m8) = 5 
(I-24) 
+ é,) + > Ni + 
' (1-25) 
A(X... — + — n.;.£,) 
+ > 6,(X,. 


i 2 =1 


The above equations plus the equations 


a c d 
ij=1 h=1 

yield unique estimates of the effects. 

The form for the linear covariance analysis is indicated in Table 3. 
In general, the various sums of squares for the variance analysis are 
obtained as indicated in Table 3. The procedure is easily generalized 
for a qg-way classification with b covariates. The algebra and the arith- 
metie become more difficult but the principles are the same. 

In order to obtain a test of significance for two treatment means, it is 
necessary to compute the variances for the two means considered. 
Although the variances and covariances of the Q;. values in (I-7) are 
known [12], it is rather cumbersome to compute the variance of a 
difference for two effects, say 7, and 7, , but if this is desired, Rao [15] 
has described the general procedure. Also, one could obtain an average 
coefficient for the variance of a treatment mean and then use the result- 
ing average standard error of a mean in one of the multiple range tests 
[see 7, ch. IT]. 
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TABLE 3 


ANALYSIS OF COVARIANCE (LINEAR) For A THREE-WAy CLASSIFICATION 


WitTH NUMBERS IN THE SUBCLASSES—CasE I* 


Sums of products 


Source of variation Df. y? xy x? 

Total (eliminating mean) | n... — 1 T yy Try T xx 
A (ignoring C and D; 

elim. mean) a-l Us, Usy Usz 
C (ign. D; elim. A and 

mean) c-1 View Vex 
D (elim. A, C, and 

mean) d-1 Dez 
Residual (by sub- 

traction) f- Ruy Rry Riz 
C(elim. A, D, and 

mean) Coy Ce 
A (elim. C, D, and 

Residual (elim. regression, 

A, C, D, and mean) f--1 Riy = Ry — Ri,/Rez 
Residual + D fp +d — Shy = Ry + Dy — (Rey + Dey)?/(Rrz + Dez) 
D (elim. regression, A, C, 

and mean) d-1 Diy = Sty — Roy 
C + residual fp +o — = Ryy + Cy — (Rey + Cry)?/(Rez + Coz) 
C (elim. regression, A, D, 

and mean) c—1 Cy = Wi — Rey 
A + residual fe +a — 2) Zjy = Ayy + Ryy — (Ary + Rey)?/(Azz + Res) 
A (elim. regression, 

C, D, and mean) a-1 = — Ri, 


Tw = Vas — SS”); 


w = SS(u-, ai, — SS(u*, af); 


Di, = SS(A, ai, Ti» = SS(u”, a@ 


>Bo); 


Uy = SS(u* af) — SS(u'"); 
Dyy SS(u’, ai, vj, dn) SS(u-, a, 


CASE II 


When interaction is present in a two-way classifieation with 6 
covariates, the linear model is: 
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Yio = etre + ors t+ ein t — 


where yu, 7; , p; , and 8, are defined in equation (I-2) and where pr,; = 
an interaction effect common to the zjth combination of the two eate- 
gories. The effects are considered to be fixed effects and n;; = 1, 2, --- 
(i.e., n;; > 0). 

The normal equations for A, 7; , 6; , p7:; , and 8, , respectively, are: 


ni (a+ #) + Do + 
(II-3) 
+ n;.&,) = ; 
(II-4) 


The above 1 + v + r + rv + b equations plus the following result in 
unique solutions for the , 7; , p7:; , and 8, : 


(11-4) 


= 2 = 2, = 0. (11-7) 
i=l j=1 i=l 


If b = 1, then an estimate of 8 = 8, is obtained from the within 
subclasses line in the analysis of covariance (Table 4) as: 


{3 — = §., 
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If b > 1 then the various 8, may be obtained from the following b 
equations [see 16, sec. 13.7]: 


i h (II-9) 


forg = 1,2, --- , 


With the above estimates for 8, , the remaining effects are estimated 
as follows: 


= Gu. — — #,)); (II-11) 


= Hii. — 6; — — — (11-18) 


If one or more of the n;; = 0, no analysis is possible unless a zero (or 
sme other constant value) is inserted for tne corresponding 7,; ; 
this results in the following changes for formula (II-7) [8, 10]: 


ré 
= Dy = 0; (11-14) 
t=1 i=1 


These changes result in a biased analysis. The size of the bias depends 
upon the actual values of p7,; which are estimated as zero and upon 
the number of ,;; equal to zero in the analysis. When some n,;; = 0, 
possible recourses are to use the biased Case II analysis, to use a Case I 
walysis assuming no interaction, or to use an among subclasses and 
within subclasses analysis. 

Before computing the various sums of products it should be noted 
that there may be no interest in discussing the category means when 
teraction is assumed present. If there is interest in the category 
means then weighted means may be more appropriate than the un- 
weighted means. If an among and a within groups variance or co- 
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variance analysis is suitable (i.e., the rv or r = 2 r, = Do, 0; =0 
subclass means are to be compared), then sie is little difficulty in 
computing the among and within subclasses analysis and in comparing 
the subclass means, j,;. , either by an F test or a multiple range test [see 
7, ch. 

If tests of significance are required for category means or if compo- 
nents of variance are to be estimated by Henderson’s Method 3 [11], 
the various sums of products in Table 4 need to be computed. The 
first three rows of sums of products in Table 4 are obtained from a 
Case I analysis (i.e., formulae (I-10), (I-11), (i-13), (1-16), (1-17), 
and (I-19). The within subclasses sums of products are vhteianh a in 
the usual manner [see formula (II-8)]. The interaction sums of products 
are computed as: 


72 


1 


II 


X oii. Vij. 


where the estimates uv’, p} , and r/ are defined in the paragraph preceding 
formula (I-10) and where , j; , 7; , and p7;; are the estimates obtained 
from formulae (1I-1) to (II-7) when each X,;;, — #, is set equal to 
zero. 

The sums of products for the first factor (eliminating mean, second 
factor and interaction) are [10, 21]: 


*In a multiple range test, the standard error of a mean used to compute a significant range for 
comparing two means, say ji.. and g.. with ni. and nz. observations, respectively, is (ID. B. Duncan, 


written correspondence) : 
8, fl ( 1 1 )} 


In a covariance analysis, the average variance of a difference for A category means, say i and,’ 
is approximately equal to 
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Aw SSG, ’ ’ pT, ;) = SS(u", p; 
(II-18) 

on. \2 
Gi. = (o> w;.Gi..) 


wi. 


where u”, p; , p7,; are the estimates obtained from the normal equations 


when each 7, and each (X,;;, 


— £,) is set equal to zero; 


A, = 2. Wi (11-19) 
An = Dw#.. - (11-20) 


TABLE 4 


ANALYSIS OF COVARIANCE (LINEAR) For A Two-way CLassIFICATION WITH 
UNEQUAL NUMBERS IN THE SUBCLASSES——CaskE IT 


(First Facror = A; Seconp Facror = B). 


Sums of products 
Source of variation Ds. y? ry x? 
Total (eliminating mean) n,.— 1 T yy Tis 
B (elim. mean; ign. A and 
A X B) r—1 Rey 
A (elim. B and mean; ign. 
A X B) v—1 Vex Vey Vex 
A X B (elim. A, B, and mean)} (r — 1)(v — 1) re Tey | 
Within subclasses = S n.. — rv Say Sie 
B (elim. A, A X B, and mean) r—1 Bss Bey Bez 
A (elim. B, A X B, and mean) v—-1 Avy Agy As 
8 (adj. for regression) = — / See 
S+B = Byy + Syy 
+r—2 (Bry + Szy)?/ (Bez + S:z) 
B (adj. for other effects) r-—1 By = U,, — Si, 
S+A — rv = Ay 
A (adj. for other effects) = Way — 
S+AXB = Tyy + Syy 
= (Tey + Sxy)?/ Szz) 
A X B (adj. for other effects) | (r —I)(’—1)| Ty = Ziy — Sty 
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In the above, 


-1 
and = (4 +) 


| 


j=1 i=1 


Due to a theorem given by Yates [22], the F or z test of significance is 


appropriate for comparing the two mean squares A,,/(v — 1) and 
S,,/(n.. — rv). Likewise, 
@ 


is a valid test of the null hypothesis of zero effects in a covariance 
analysis. 


Similarly, 
B., i > > (II-21) 
where 


For a three-way classification with b covariates and interaction effects 
not assumed to be zero as in (I-21), the linear model is: 


Y & + aby + + 


+ + BAX giins #,), 


where uw, , ¥; , and 8, are defined in (I-21) and where the 
remaining terms represent interaction effects (fixed) between the three 
classifications. Here, »;;, = number of observations for the zjhth 
combination of the three factors must be greater than zero. If any 
n;;, = 0, then the alternatives described for the preceding unbalanced 
two-way classification may be used. The normal equations for the 
various effects are set up in the usual manner; e.g., the normal equation 
for &; is: 


n;. (a + &) + + + 
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The sums of squares for the various effects except the mean are 
presented in Table 5. The method of computation for sums of squares 
which add to the total;is given in the upper part of the table. The 
resulting mean squares from the next to the last line in the upper part 
and all those from the lower part of Table 5 are used in making tests of 
significance of the null hypotheses for each of the main effects and 
interactions. The within subclasses mean square is the error mean 
square for all F tests. 

Covariance analyses are carried out in the same manner as for the 
two-way classification. Il’or example, the sum of cross products for 
AXC ore all other effects) is: 


where the estimates with the superscript’”’ dre obtained from the 
remaining normal equations with each @y,; and each (X,,;,, — 4) 
set equal to zero. 

With b covariates, solutions for 8, may be obtained from formulae 
similar to (II-9), i.e., the within subclasses sums of squares and cross 
products for a three (or higher)-way classification. Likewise, a direct 
extension of the results in Tables 4 and 5 results in a variance or co- 
variance analysis for a g-way classification. 

CASE III 

If one or both categories in a two-way classification are considered 
to be a sample of levels or treatments from a large population of levels 
or treatments for the given category, the interaction mean square is 
used to test hypotheses about mean effects and to adjust the means for 

variation due to a covariate(s). Also, the interaction effects, as such, 
may not be of much interest except in estimating the component of 
variance associated with interaction effects. Since a weighted estimate 
of the effect would be preferable, for statistical reasons, to an unweighted 
estimate, the problem of estimating effects is complicated because the 
weights are unknown. If the weights are estimated from the data, 
further statistical problems arise. In certain experiments, the experi- 
menter may have little or no control over the number of observations in 
each subclass and it may be realistic to assume that the linear model is of 
the following form: 
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+ + BUX £i;.), 
where u, 7; , p; , and e¢,;, , , and X;,;, are defined in (II-1) except that 
and/or p; are considered to be random effects, Z;;. = X ja/Ni; 5 
pr;; = an interaction effect of the 7jth combination of the two categories 
and is a random effect, 8, = regression coefficient from the interaction 


(III-1) 


‘jine in the covariance analysis, and 8 = a within subclasses regression 


coefficient; if 8, = 6 then (III-1) reduces to (II-1) except for the random 
effects set-up for (III-1) as opposed to the fixed effects assumption in 
(II-1). For such situations, a Case III analysis may be desired and 
the weights may have to be estimated from data in the present or in a 
past experiment. The weights themselves will have a sampling distri- 
bution but the experiment would usually be analyzed assuming that 
the estimated weights were the true weights. 

Two sums of squares could be minimized to obtain least squares 
estimates of the effects. These are: 


r nid 

and 


Wi(Gii. — B— — — — 2)’, (ITI-3) 
j= 
where g,;;, and ;;, = subclass means for the two variates. If w,; = n,; 
in (III-3) where n;; = number of observations in the zjth subclass, 
minimization of (III-3) results in the same estimates of u, 7; , p; , 
and 8, obtained from minimizing (III-2). Since this is true, since w;; 
is not always equal to ;; , and since @ is estimated from the within 
subclasses sums of products according to formula (II-8), the sum of 
squares in (III[-8) is minimized instead of the one in (III-2). 
The problem of correct weighting is relatively simple if the true 
weights are known. ‘Thus, weighting inversely to the variance of 9; ;. 
results in: 


1 Ni; 


= 3 27 = 2 
Cpr + O./N;; + O~ 


(III-4) 


where o;, and o? are the variance components associated with interaction 
and within subclasses, respectively. In practice 2 and o7, are usually 
unknown. First, consider the two limiting situations: 

(i) o, is large relative to o%/n,; ; 

(ii) o%, is small relative to o2/n,; . 
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In situation (i), one may set w,; = 1 for all practical purposes; i.e., an 
analysis of covariance is performed on the subleass means. In the second 
situation, w,; is set equal to n,;; , and the analysis goes through as de- 
scribed below using the weights w;; [see 8]. 

The true situation is usually in between the above two limiting 
situations, and in order to perform an analysis on the data it will be 
necessary to have reasonably good estimates of o3, and of. In many 
situations the estimated variance (or covariance) components will need 
to be estimated from the data themselves. One method for doing this is 
illustrated by Federer [8], where Henderson’s [11] Method 1 was used. 
This method assumes that both effects are random effects; if one of 
the effects is fixed then a bias results [11]. Recourse to Henderson’s 
[11] Methods 2 or 3 may be made if one of the effects is considered to be 
fixed. 

The normal equations for , #; , 6; , and 8, , respectively, are: 


r r 
w, p+ 7; + + Bi W;(%i;. — Z) 
j=1 


t=1 j=1 


w; (a + 7) + wild; + — £)) = 5 (IIT-6) 


A Ww, — + > — Z) 


t=1 j=1 i=1 
r 


i=l 
+ B, Wi — #)° = — Z). 
In the above 


t=1 j=1 i=1 t=1 

w;; = weight for zjth subclass mean; w,; could be zero and the analysis 

could still be performed. That is, zero observations could be obtained 

for certain subclasses: If this were a random event, the interaction 

effect for that subclass would be set equal to its expected value, zero. 

Substituting for a, 6; , and f, results in » equations in the 7; , the 
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kth equation being: 


r r Wi: 2 


j=l j=l g=1 WG 
r 
Wy; W;; 
A 
W.; 
#k 
1 - X 
Vex i=1 i=l i 
4 
Wi; 
xX W; 5X5; W ) 
i=l W,; i=1 i=1 
Tr 
i= j=1 
G 
7=1 W,; t=1 
where 
t=1 j=1 i= i=1 
and 


t=1 j=1 i=1 
Likewise, the k-th equation of the v equations in the 7; in terms of 8, 
and the observations is: 


r 


= = = 
7=1 7=1 i=1 (1II-12) 
r r w 
A ki 
j=1 j=1 W.; 


The v equations in (III-12) plus the equation >°3_, #; = 0 result in a 
solution for the 7;. Ifa variance analysis is desired each &,;. in (III-12) 
is set equal to zero and the resulting estimates 7/ of the 7; are obtained. 
8, in the covariance analysis is the interaction sum of cross products 
divided by the interaction sum of squares for the covariate, i.e. 6, = 
E.,/E.. (Table 6). 

.The sums of products in the first three rows of Table 6 are obtained 
inthe usual manner [see formulae (I-10), (I-16), and (IT-8)]. The 
remaining sums of the cross products are given below: 
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Ww. 


> i i 


R., = 
(111-14) 
+ wikis.) + OO wikis) 


Il 


where y’, p} , and r/ are the estimates obtained from the normal equations 


with each (%;;, — £) set equal to zero; the uZ , pf; , and 7, are similar 
estimates obtained for the X variate; 


oD wits.) + wees.) 


The procedure for obtaining the sums of squares eliminating regression 
is indicated in Table 6. 

The procedure for b covariates is a straightforward extension of the 
results for a Case II analysis. The normal equations given by formulae 
(III-5) to (III-8) would be altered for the terms involving 8, and the 
#,;.’s. Instead of the term — #) substitute 8,,(4,:;. — 
in (III-1) and make the corresponding changes in formulae (III-5) 
to (III-12). The 8,, would be obtained from the Error (interaction) 
line in the multiple covariance analysis as described for a Case II 
analysis. 

For a three-way classification and for the random effects situation for 
all three categories, additional statistical problems arise in testing 
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TABLE 6 


CovaRIANCE (LINEAR) ANALYSIS For AN UNBALANCED Two-way 
CLASSIFICATION—CaseE III. 


Sums of products 


Source of variation D.f. y? xy x 


Total (elim. mean 


only) n,.—1 Ty 
Within subclasses n..— Szz 
Among subclasses r,—1 by subtraction 


Among weighted sub- 


class totals r,—1 Wor Wry Wes 
B (elim. mean; ign. A)} r— 1 Rey 
A (elim. mean and B) 

=A v—1 Agy Ags 
Error (interaction) 

=E fe Eyy Ex 
B (elim. mean and A) 

=B r—1 Bo By Bez 

—| 

E (elim. A, B, mean, 

and regression) fe—1 = Ey, — 
A E fe 2 Usy Ay, Ey, Exy)?/(Azz Ezz) 
A (elim. B, mean, 

and regression) = Usy — Evy 
B + E fe + B yy + + Exy)?/(Bzz + Ez) 
B (elim. A, mean, 

and regression) r—1 Biy = Vey — Evy 


hypotheses about main effects. For example, consider that the linear 


model is: 

+ — 2.4. +2) (111-18) 
+ + 28,;... + + 22... 


? 
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where the effects are defined in (II-22) except that all effects are random 
effects, 8, , B2 , 8; and @, are different regression coefficients from the 
various interaction lines in the analysis of covariance. In such a model, 
there is no single line in the analysis of covariance that is suitable for 
testing hypotheses about main effects and for adjusting for variation 
in the covariate. A simplifying assumption would be to assume 
8, = B2 = 8; = B, and possibly that 8, = 8 also. This would simplify 
the covariance problem but would not simplify the hypotheses testing 
problem [7, ch. VIII]. A simplification in hypothesis testing would be 
possible if one or more of the two-factor interactions could be assumed 
relatively small, and then there would be a single interaction mean 
square which could be used to test hypotheses about main effects 
concerned. 

The normal equations and the Case III covariance analysis for the 
unbalanced three-way classification represent a straightforward exten- 
sion of the Case II analysis for a three-way classification and of a 
Case ITI analysis for a two-way classification. Solutions for the various 
effects are obtained as before. 

jstimation of variance components from a given set of data, either 
the present or a past experiment, presents some problems even for an 
unbalanced two-way classification. As a first approximation in an 
experiment one could use Henderson’s [11] Method 1 assuming both 
effects random and assuming w,; fixed [see 8]. The process is repeated 
on each analysis and the new weights are used in the subsequent analysis. 
The process is repeated until the weights stabilize. 

Alternatively, one could use Henderson’s [11] Method 3 and obtain 
the expectation of J’, in Table 4 asa first estimate for o%,. 2 is estimated 
from the within subclasses mean squares as 62 = Sj,/(n.. — rv — 1). 
The analysis of covariance in Table 6 is obtained. As a second estimate 
of o%, one could obtain the expected value of E%, in Table 6 for fixed w,;. 
Then, Table 6 could be recomputed using the second estimate of 0}, 
and 62 . Table 6 could be recomputed and a third estimate of ¢}, 
obtained. The process could be repeated until estimates of o%, stabilize. 

For a three-way or higher-way classification Henderson’s [Il] 
Method 1 probably should be used to obtain the estimated variance 
components if the degrees of freedom are fairly large, say greater than 
20 to 30, for each sum of squares considered. Also, the three-way 
classification could be collapsed into a two-way classification for certain 
experimental situations in order to simplify the analysis. In still other 
situations, a Case II analysis might be necessary for two factors whereas 
the third factor and interactions with the third factor would involve a 
Case III analysis. 
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NUMERICAL EXAMPLES* 


An experiment was conducted on the effect of different humidity 
treatments planted at different times (6/15, 7/15, 8/15, and 9/15) on 
roses over a period of months. One characteristic measured was the 
number of salable roses per month. The data for Y = number of 
salable roses in Tables 7 and 9 represent a selected sample of data from 
the total experiment. The X covariate represents the position on the 
greenhouse bench and is used to control variation within a replicate 
[see 7, sec. XVI-11]. The two replicates were on opposite greenhouse 
benches. For the data in Table 7, the positions were grouped into 5 
sets of 3 each and given the numbers 1 to 5. A Case I analysis of co- 
variance for these data and the treatment means adjusted for regression 
are presented in Table 8. Since this is a 2 & 5 table, Snedecor’s [16, 
sec. 11.11] analysis of variance procedure may be followed for the Y 
and X variates, and for the cross products, or the entire analysis follows 
from a direct application of the formulae given for a Case I analysis. 


TABLE 7 


NuMBER OF SALABLE FLowers (Rosgs), Y;;,, OPENED IN ONE MontH 
(DECEMBER) AND LocaTION ON GREENHOUSE BENcH, X;j,—6/15 PLANTING 


Rep. I Rep. II Total 

Treatment Y | X | Y | X Y | X | VY Xi Ve. | 
2 127 5 | 18 5 1 | 19 3 64 14 

2 1 } 31 3/;/—|— 2 | 52 5 | 57 5 3 140 13 

3 1 | 34 2;—|— 2 | 52 1 | 33 1 3 119 4 

4 1 | 38 4|;—|— 2 | 60 2 | 45 2 3 143 8 

5 1 | 31 1j;—|— 2 | 50 3 | 40 3 3 121 7 
Total 6 |¥.1.=179; X.1.=20| 9 |Y¥...=408; X.2.=26) 15 587 46 


The data in Table 9 are the number of salable roses obtained for the 
month of April from the 7/15 planting, Y, and the position on the 
greenhouse bench, X. The 15 positions were not grouped as they were 
in Table 7, but represent the location of the treatment in one of the 
15 possible positions. It is assumed, as before, that there was a linear 


*The data for this example were obtained throvgh the courtesy of Dr. R. W. Langhans, Dept. of 
Floriculture, Cornell University, from an experiment reported in his doctoral dissertation, 
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TABLE 8 


Case I CovariaNcE ANALYSIS FOR Data IN TABLE 7. 


Sums of products 


Source of variation Dif. y? xy x? 
Total (uncorrected) 15 25407 1755 174 
Correction for mean 1 22971 .27 1800.133 141.067 

Total 14 2435.73 — 45.133 32.933 
Replicate (ign. treat.; elim. mean) 1 864.90 — 24.800 0.711 
Treatment (elim. mean and rep.) 4 912.70 —47 .667 22.889 
Residual 9 658.13 27 .334 9.333 

D.f. Adj. sum of squares |Mean square 

Residual (elim. reg., treat., rep. 

and mean) 8 578.08 72.26 
Treat. + residual 12 1558 .00 o 
Treatment (elim. reg., rep., and 

mean) 4 979.92 244.98 


= 18.55 A+; = 42.85 A+ 7; = 40.58 
A +7: = 41.06 A+ 46.94 A = —5.7 = — fh; 


ll 


gradient from one end of the bench to the other. The Case II covariance 
analysis for the data of Table 9 is presented in Table 10, and is obtained 
by a direct application of the formulae given for a Case II analysis. 
The adjusted treatment means are 80.78, 74.15, 81.02, 63.78 and 61.78 
for treatments 1, 2, 3, 4, and 5 respectively. 


TABLE 9 


NuMBER OF SALABLE FLOWERS (Rosss), Y;;,, OPENED IN ONE Montu 
(AprRiL) AND LocaTIon ON GREENHOUSE BENncH, X ;;,—7/15 PLANTING. 


Rep. I Rep. II Total 

Treatment | Y | X | X | X ¥ | Ve. | Xe. 
1 1 |} 15}; —| — 10) 79} 11 3 252 | 36 

2 2 | 84 9 | 81 7 1 | 76 | 14|—|]— 3 241 30 

3 2 | 67 5 | 83 4 1 | 74 2|;—|— 3 224 11 

4 1} 71] 2 | 51 4 | 63 5 3 185 | 20 

5 1 | 53 | 2 63 8 | 61 3 177 17 
Total | 7 541; X.1,=53} 8 |Y¥.2.=538; X.2,=61| 15 | 1079 | 114 
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TABLE 10 


Cask II ANnAtysis oF Data IN TABLE 9. 


Sums of products 


Source of variation Ds. y? zy 
. Total (elim. mean) 14 2426.93 447.60 | 229.60 

Replicate (ign. tr. and inter.; 

elim. mean) 1 376.00 —2.01 0.01 
Treatment (elim. rep. and mean; 

ign. inter.) 4 1320.97 303.81 136.12 
I = Interaction (elim. tr., rep., 

and mean) 4 491.46 139.80 89.47 
S = With subclasses 5 238.50 6.00 4.00 
T = Treatment (elim. rep., inter., 

and mean) 4 1606 . 67 367.33 172.53 


D.f. | Adj. sum of squares | Mean square 
S (adjusted for regression) 4 229.50 57.38 
) 4 273.03 68.26 
* ) 8 1055.64 
) 4 826.14 206.54 


The data from Table 9 are used to illustrate a Case III linear co- 
variance analysis. The computational form for the analysis is given in 
Table 11. The first set of weights are obtained from the variance 
components estimated from the data (Tables 4 and 10) [see 8]: 


57.38; 


1 > — 
= To14 {68.26 — 57.38} = 10.73, 


1 (+ -1+-4) 


where 
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Since n;; = 1 or 2 the two weights are (formula (IIT-4)) 


1 
10.73 + 57.38 


O15 


and 
2 — 
2(10.73) + 57.38 


Since there are only two weights, the coded weights w,, = 3 and w,, = 5 
will be used to simplify the computations. The actual weights may be 
simpler to use than coded weights in examples where the n,,; vary 
considerably. 

With the computed weights Table 11 can now be completed. Applying 
the formulae given for a Case III analysis, Table 12 is constructed. The 
various estimates used to obtain the sums of products are the y’ , 7/ , 
p; , wi, 72, , and p{; values in Table 12. 

A second estimate of o%, is obtained from Tables 6 and 12 as 


767.86 
= To las + 2.6875 — 13.4375 — 5.3797 57.38} = 10.01, 


Wi2 = 


025. 


Pac 


where k, is computed in the same manner as described above and where 
27.5 + 2.6875 + 13.4375 — 5.3791 
wii (1-1-1) 
Ni; Ni; W;. 
Since the second estimate of of, is almost identical with the first 
estimate of o%, , which is 62, = 10.73, the second set of weights would be 
almost identical with the first. Likewise, the new analysis of variance 
table and the adjusted effect differences would be very similar. There- 
fore, the iterative analysis of variance stops here and the adjusted 
treatment means are obtained (Table 12). Under the assumption of 
fixed weights the tests of significance suggested previously may be 
carried out using the interaction mean square as the error mean square. 
After obtaining 8, = 393.19/251.63 = 1.562572, the a + #, and 6; 
values may be computed. The two equations for the f,;’s are (III-12): 


p. = 1465.5 — 3(681.0 + 498.0 + 469.0) 


> 

| 

| 


— 3(640.5 + 597.0) 
— 1.562572[146.5 — 3(97.5 + 55.5 + 43.5) 
— 3(82.0 + 28.5)] 

74.0625 — 5.859645 = 68.202855; 
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TABLE 12 
CovaARIANCE ANALYSIS FOR THE Data IN TABLE 11. 
Sum of products 
Source of variation DJ. y? xy 
Total (on means) 9 6111.49 1273.29 648.78 
Replicate (ign. treat.) 1 902.62 6.42 .05 
Treatment (elim. rep.) = 4 3826 .62 873.68 397.10 
Error = E 4 1382.25 393.19 251.63 
Within subclasses = S 5 238.50 6.00 4.00 
Treatment (ign. rep.) + 4144.15 850.48 395.65 
Regression | Adjusted sum of squares 
Source of variation Dé. Ss | D.f. Ss Ms 
Ss | 1 9.00 4 229.50 57.38 
E 1 614.39 3 767 .86 255.95 
E+T | 1 2474.00 7 2734.87 —- 
Treatment (elim. other effects) 4 1967.01 491.75 
= 72.3350 = — 9.0975 = 4.5525 = —2.1975 
mi = 13.7775 7 = —12.7225 t22 = 2.5150 ue = 7.6850 
7s = 6.7400 pL = 3.9500 tls; = —4.1725 par = 0.2000 
7 = 1.3025 ps = — 3.9500 ry = —0.6975 pre = —0.2000 
B = 1.5000 A +73 = 80.0245 
Bi = 1.5626 A + = 64.1946 
+71 = 78.8661 +7, = 62.9134 
A + #2. = 75.0123 = 3.6375 = 


Alternatively, the two equations involving #, and #, from formula 
(III-9) are: 


(116.5 ~ 3(196.5) — 4110.5)" | 


(2 + 953.13 13 3-7 5)(—8.75) 


422.81 
= 5 _ = 6 a 8 72, 
74.0625 253.13 (3.75) 17.7987 
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and 
A = 

where 253.13 = 648.78 — 395.65 and 422.81 = 1273.29 — 850.48. 
SUMMARY 


Variance and covariance analyses have been classified under three 


categories, viz., Case I, interaction absent; Case II, interaction present 


and the effects assumed to be fixed effects; and Case III, interaction 
present and the interaction effects and at least one of the main effects 
of the factors represented in the interactions assumed to be random 
effects. The statistical procedures for the three cases have been derived 
for two-way and three-way classifications and are illustrated with numer- 
ical examples for the two-way classification with a covariate. The pro- 
cedures for a g-way classification with b covariates are indicated. 


REFERENCES 
[1] Bartlett, M. S. [1936] A note on the analysis of covariance, J. Agr. Sci. 26: 
488-491. 
{2} Bartlett, M. S. [1937] Some examples of statistical methods of research in 
agriculture and applied biology, J. Roy. Stat. Soc. Suppl. 4: 137-183. 

[3] Cochran, W. G. [1943] Analysis of variance for percentages based on unequal 
numbers, J. Amer. Stat. Assoc. 38: 287-301. 

[4] Das, M. N. [1953] Analysis of covariance in two-way classification with dis- 
proportionate cell frequencies, J. Indian Soc. Agric. Stat. 5: 161-178. 

[5] Day, B. and Fisher, R. A. [1937] The comparison of variability in populations 
having unequal means. An example of the analysis of covariance with mul- 
tiple dependent and independent variates, Annals of Eugenics 7: 333-348. 

{6] Federer, W. T. [1954] Covariance analysis in a two-way classification with un- 
equal numbers in the subclasses, BU-51-M (mimeograph), Cornell University, 
Ithaca, New York, April. 

{7} Federer, W. T. [1955] Experimental design—theory and application, Macmillan 
Company, New York. 

[8] Federer, W. T. Covariance analysis for unbalanced two-way classifications, Cornell 
Univ. Agric. Expt. Sta. Memoir (accepted for publication). 

[9] Hazel, L. N. [1946] The covariance analysis of multiple classification tables 
with unequal subclass numbers, Biometrics 2: 21-25. 

(10] Henderson, C. R. [1948] Estimation of general, specific, and maternal combining 
abilities in crosses among inbred lines of swine, Ph.D. Thesis, Iowa State 
College Library, Ames, Iowa. 

{11] Henderson, C. R. [1953] Estimation of variance and covariance components, 
Biometrics 9: 226-252. 

{12] Nair, K. R. [1941] A note on the method of “fitting of constants” for analysis 
of non-orthogonal data arranged in a double classification, Sankhya 5: 317- 
328. 

(13] Outhwaite, A. D., and Rutherford, A. [1955] Covariance analysis as an alterna- 
tive to stratification in the control of gradients, Biometrics 11: 431-440. 


‘ 


362 BIOMETRICS, SEPTEMBER 1957 


[14] Quenouille, M. H. [1948] The analysis of covariance and non-orthogonal com- 
parisons, Biometrics 4: 240-246. 

[15] Rao, C. R. [1946] On the linear combination of observations and the general 
theory of least squares, Sankhyd 7: 237-256. 

[16] Snedecor, G. W. [1946] Statistical methods, 4th ed. Iowa State College Press, 
Ames, Iowa. 

[17] Snedecor, G. W. and Cox, G. M. [1935] Disproportionate subclass numbers in 
tables of multiple classification, Iowa Agric. Expt. Sta. Res. Bul. 180: 233-272, 

[18] Stevens, W. L. [1948] Statistical analysis of a non-orthogonal tri-factorial 
experiment, Biometrika 35: 346-367. 

(19] Wilks, S. S. [1938] Analysis of variance and covariance in non-orthogonal data, 
Metron 13: 141-154. 

[20] Yates, F. [1933] The principles of orthogonality and confounding in replicated 
experiments, J. Agric. Sci. 23: 108-145. 

[21] Yates, F. [1934] The analysis of multiple classifications with unequal numbers 
in the different classes, J. Amer. Stat. Assoc. 29: 51-66. 

{22] Yates, F. [1938] Orthogonal functions and tests of significance in the analysis 

of variance, J. Roy Stat. Soc. Suppl. 5: 177-180. 


A, 
| 
} 
4 


THE ANALYSIS OF COVARIANCE WITH INCOMPLETE DATA 


G. N. WILKINSON 


Commonwealth Scientific and Industrial Research Organization, Division of 
Mathematical Statistics, Adelaide, Australia 


1. Introduction 


In a paper [4] submitted for publication in this journal, the author 
‘has presented simplified methods for setting up and solving equations 
for missing values. The methods apply to those designs and data for 
which normal theory, with linear model, provides the appropriate 
analysis, and have been extended to cover covariance analyses. 

A. T. James, in a personal communication, drew the author’s 
attention to the fact that concomitant measurements corresponding to 
missing observations are irrelevant to the analysis of existing observa- 
tions, and could therefore be replaced by hypothetical values more 
convenient for determining missing values and for the subsequent 
analysis of covariance on the completed data. Application of this 
suggestion has greatly simplified the author’s results, and as the pro- 
cedure is so simple, it has been thought worth while to outline it in 
this advance note. 

Since deriving these results, the author’s attention has been drawn - 
to a paper by Barnard [1] in which the method of fitting values appears 
to be the same as that proposed here. Barnard’s description is very 
brief, however, being incidental to the main content of the paper 
(analysis of a crop-weather scheme), and seems to have been overlooked 
in the literature and textbooks. The present paper gives details of the 
fitting process, and also of deriving standard errors and exact significance 
tests. 


2. Basic procedure 


Suppose that a covariance analysis is required on a set of observa- 
tions of a variate y and the corresponding measurements of p concomitant 

If some observations on y are missing, 

(i) discard all measurements of x, , 2, --+ , x, that correspond 
to the missing observations on y, 

(ii) fit a set of missing values for y, as though for an ordinary 
analysis (ignoring covariance), 

(iii) with exactly parallel calculations fit sets of missing values 
for 2, ,%2,°** , t», to replace those discarded, 

(iv) carry out the covariance analysis on the completed data 
for y and 2%, , °°* 2%». 
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These steps provide the simplest set of calculations for exact analysis, 
and formal justification of the procedure is given in [4]. 


3. Estimation of the missing values 


Each set of missing values could, of course, be determined iteratively, 
but solution of missing value equations is much the shorter method. 
Each set of equations has the form 


Au=E 


where A is the matrix of coefficients, u is the vector of missing value 
unknowns, and E is a vector of initial plot estimates for those plots 
with missing observations. The initial plot estimates are determined 
by substituting zeros for the missing (or discarded values) in the data, 
that is, by using incomplete totals, etc., as though they were complete. 
The matrix A is determined by the configuration in the design of the 
plots with missing values, and is the same for all p + 1 sets of equations. 
Inversion of this one matrix therefore provides estimates of missing 
values for all the variates y, 7, ,2., +--+ ,2,. Rapid methods for forming 
and inverting the matrix of missing value equations are given in [4]. 
Each element of the matrix corresponds to a pair of missing values 
(the diagonal elements corresponding to the identity pairs (u, u)), 
and is determined by their relationship in the design, for example, 
“Same Block” or “Identical”. The author has prepared for a number 
of the standard designs, tables which give the various possible relation- 
ships for each design, and the corresponding elements from which the 
matrix may be constructed. One such table (Table 4) is given. 


4. Numerical example 


So that comparison may be made with the parallel analysis on a 
complete set of observations, the data used by Quenouille [3] (pp. 
40-47) have been rendered incomplete by removing three observations 
at random. 

The data are the results of an insulin-response experiment, with 
four dosage levels (treatments) A, B, C and D, carried out on eight 
rabbits on each of four occasions (phases), according to the Latin 
square design shown in Table 1. The percentage falls in blood sugar 
(y) are given in Table 2, and the initial blood sugars (a) in Table 3. 
The three measurements of x corresponding to the missing observations 
on y are omitted in Table 3, in accordance with 2(i) above. Incomplete 
totals are shown in separate columns, and completed totals, obtained 
after estimating missing values, are indicated by heavy type. 
Calculations for determining the missing values u, v and w are 
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TABLE 1 
DesiIGn oF INSULIN-RESPONSE EXPERIMENT 
Rabbit 
Phasei1 23 4567 8 
1 CAB DCA Bs 
2 BD¢-A DB aA 
3 AC DBA 
4 
TABLE 2 
PERCENTAGE FALL IN BLoop Suaar (y) 
Totals 
Rabbit Incom- Com- 
Phase 1 2 3 + 5 6 7 8 |plete plete 
1 a2.7 11.2 23.2 48.1 35.1 27.2 36.0. 40:0 253.5 
2 26.2 31.8 28.9 18.7 37.2 w(B) 37.8 25.8 |206.4 242.3 
3 —4.0 14.0 27.5 v(B) 2.8 36.2 28.4 50.7 {155.6 179.7 
4 u(D) 16.5 21.2 40.2 12.7 47.7 25.1 39.4 |202.8 236.7 
Incomplete 
total 54.9 107.0 111.1 818.3 
Complete 
total 88.8 73.5 100.8 131.1 87.8 147.0 127.3 155.9 912.2 


Dosage | A 


B C 


D 


Incomplete total 
Complete total 


147.0 


279.0 


128.0 207.0 264.3 312.9 


shown in Table 5. For computational convenience the equations have 
been multiplied through by a factor of 16. The matrix A may be 
determined from first principles, or more rapidly from Table 4: the 
diagonal elements in 16A are 9; the missing values (u, v), (wu, w) are in 
different rows and columns, and have different treatments, so that the 
corresponding elements in the matrix are +1; the pair (v, w), in different 
rows and columns, have the same treatments, hence the corresponding 


element is —1. 


For a 3 X 3 matrix the simplest inversion procedure is to form the 
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TABLE 3 


InrTIAL SuGar (x) 


Totals 
Rabbit Incom- Com- 
Phase 1 2 3 4 5 6 7 8 plete plete 
: 107 91 99 93 103 88 94 89 764 
2 94 93 90 77 94 w(B) 91 89 628 707.9 
3 75 83 91 w(B) 87 79 81 84 580 653.9 
4 u(D) 86 97 87 101 81 87 95 634 730.9 
Incomplete 
total 276 257 248 2606 
Complete 
total 372.9 353 377 330.9 385 327.9 353 357 2856.7 
Dosage A B C D 
Incomplete total 550 630 
Complete total 691 703.8 735 726.9 


TABLE 4 


ELEMENTS OF A-MATRIX FOR MIssING VALUE EQUATIONS. 
Row-couPLeD LATIN SQUARES 


rp X p squares | 24 X 4 squares 
(u, v) relation Qu X rp? Quy X 16 

Identical (rp — 2)(p — 1) 9 
Same C —(rp — 2) —3 
Same R, same T —2(p — 1) -3 
@f, —(p — 2) -1 
Diff. R and C, same T —(p — 2) —1 
iad diff. 7 2 1 


matrix of cofactors [A,;], and divide by the determinant | A |, which 
may be calculated by taking the sum of products of any two correspond- 
ing rows in A and [A,,]. 

The formula for a plot estimate in this design is, in the usual notation, 


Ein = (2R, + 4C; + 27. — G)/16. 
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TABLE 5 


EstTIMATION OF THE MissING VALUES 


u 16A [As] X (16)? Initial plot estimates | Missing values 
16E(y) 16E(z) uly) 
u|9 1 1 80 -10 —-—10 364.9 1026 33.9 96.9 
“= 9 -1] -10 80 10 214.9 682 24.1 73.9 
wil 9} —10 10 80 332.9 742 35.9 79.9 


|A| X (16)* = 700 


TABLE 6 
ANALYSIS OF VARIANCE AND COVARIANCE 
Source 
of Di. S.s. (y) S.p. S.s. (x) 
variation 
Rabbits 7 1612.47 —674.02 749.19 
Phases 3 407 .92 529.70 804.33 
Dosages 3 2370 . 84 546.46 155.04 
Residual 15 519.79 106.37 297 .30 
Total 28 4911.02 508.51 2005.86 
ANALYSIS OF RESIDUAL VARIANCE 
Source of variation Ds. S.s. (y) M.s. 
Regression on x i 38.06 38.06 
Residual (elim. regression) 14 481.73 34.41 
Residual (ignoring regression) 15 519.79 34.65 ~ 


Treatment means A B C D Average 
Initial blood sugar (x) 86.3750 87.9750 91.8750 90.8625 | 89.2719 
Deviations —2.8969 —1.2969 2.6031 1.5906] 0 
Fall in blood sugar (y) 
Unadjusted 16.0000 25.8750 33.0375 39.1125 | 28.5063 
Adjusted to Z = 89.2719 17.04 26.34 32.11 38.54 
(6 = 0.35781) 
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Using the incomplete totals of Tables 2 and 3, two sets of three initial 
plot estimates E(y) and E(x) (multiplied by 16), are calculated. For 
example, 


16 E,(y) = 2 X 155.6 + 4 X 107.0 + 2 X 147.0 — 818.3 = 214.9, 
The missing values are now determined by taking the sums of products 
AE. For example, in Table 5, 

w(x) = (—1 X 1026 + 1 X 682 + 8 X 742)/70 = 79.9. 


The analysis of variance and covariance on the completed data is 
shown in Table 6, with the analysis of residual variance and a table of 
unadjusted and adjusted treatment means. 


5. Variance-covariance matrix of adjusted treatment means 


If any treatment comparison involves fitted missing values, its 
usual variance must be increased somewhat. The adjustment to the 
variance is easily computed from the inverse of the matrix in the missing 
value equations. 

Suppose the treatment comparison £ contains l,u + + l;w, = 
say. The increase in variance of £ due to fitted missing values is 


Similarly, the adjustment to the covariance of two treatment com- 
parisons £ and 9M is 
m)o’. 
Hence in our example (see Table 7) the variance-covariance matrix 
of unadjusted treatment means is easily obtained. 7, and j- do not 


involve missing values, so their variance is o°/8. %, contains (v + w)/8. 
The increase in variance is therefore 


8 -1 -1][0 
si 


Xo = o = 0.06430’. 


16 


Similarly, the increase in variance of Jp is 0°/35 = 0.02860". The only 
covariance increase is for 7, and gp , and is 


~ 140° 


16 8 -1 -1 
-1 8 
0 


2 —0.00710’. 
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TABLE 7 


VARIANCE-COVARIANCE Matrix oF TREATMENT MEANS ADJUSTED TO THE 
Mean Bioop Suaar, 89.2719 ENTRIEs By a?) 


Unadjusted 
Aj} 0.125 0 0 0 
B 0 0.125 0 0 
C 0 0 0.125 0 
D 0 0 0 0.125 
Adjustment for Adjustment for covariant 
missing values adjustments 
0 0 0 0 0.028 0.013 -—0.025 —0.016 
+|0 0.064 0 —0.007 | + 0.013 0.006 —0.011 —0.007 
0 0 0 0 —0.025 —0.011 0.023 0.014 


0 —0.007 0 0.029] | —0.016 —0.007 0.014 0.009. 


0.153 0.013 —0.025 —0.016 

0.013 0.195 -—0.011 —0.014 
—0.025 —0.011 0.148 0.014 
—0.016 —0.014 0.014 0.162 


If the treatment means 7 are adjusted to a given value x, of a con- 
comitant variate, that is, to 7 — b( — 2»), the corresponding adjust- 
ment to the variance, when there are no missing values is 


(@ — Var (b) = Residual S.s. for z 


Likewise, the adjustment to the covariance of two means is 


Residual for x 


The generalization for covariant adjustment on one or more of several 
variates is straightforward. These adjustments to variance and co- 
variance also apply, without modification, when missing values are 
fitted for some observations y, provided that corresponding z-values 
have been fitted as indicated above, for the variance of b is then cor- 
rectly determined by the residual sum of squares for x in the analysis 
of covariance for the completed data. 

The variance-covariance matrix of adjusted treatment means is 
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therefore the sum of three components: 
(i) the variance-covariance matrix of unadjusted means (ignoring 
missing values), 

(ii) the adjustment to (i) for missing values, 

(iii) the adjustment to (i) for the covariant adjustments. 
These components are set out in Table 7. It will be seen that treatment 
B, for which there are two missing values, suffers an appreciable increase 
in variance. 


6. Significance tests 


Yates [6] has shown that the usual analysis of variance on data with 
fitted missing values yields a slightly exaggerated treatment sum of 
squares. In principle, to obtain an exact test of significance, an auxiliary 
set of missing values should be fitted for an auxiliary analysis of variance 
which ignores treatments. The correct treatment sum of squares is 
then the difference between the residual sum of squares (ignoring 
treatments) and the residual sum of squares (eliminating treatments), 

In practice it is not necessary to fit another set of missing values. 


Yates [6] and Cornish [2] have provided formulae for correcting the _ 


treatment sum of squares in randomized block and incomplete block 
experiments, and the author [5] has shown that the general form of such 
corrections is 

where A,,, is the matrix of the missing value equations for auxiliary 
estimates, and 7 is the vector of residuals (ignoring treatments) of the 
fitted values (main estimates). In an orthogonal design, the residual 
(ignoring treatment) of a fitted value is just the estimated treatment 
effect for that plot. 

The extension of this argument to analysis of covariance is fairly 
obvious. Residual vectors (ignoring treatments) of fitted missing 
values are obtained for all variates. Denote these vectors by 
n, &: , 2, °** & . The matrix of auxiliary missing value equations is 
the same for all variates. The “treatments” line of the analysis of 
variance and covariance is then corrected by subtracting the appropriate 
bilinear forms in 7, £, , & , --- &. Thus the treatment sum of squares 
for y is reduced by 7’A,;!n, the sum of products for x, and y by 7/Azé, 
and the sum of squares for x, by é{A,1é, , ete. The significance test 
for treatments (eliminating covariance) is then constructed in the usual 
way from the analysis of variance and covariance. 

The experimental design ignoring treatments, for the data given in 
section 4, reduces to a two-way classification, phases X rabbits. As all 


| 
| 
| 
: 
wht 
a 
a 
rae 
da 


INCOMPLETE DATA 371 


missing values are in different rows and columns, the matrix for the 
auxiliary missing value equations may be shown to be 


21 #1 0 0 111 
1 


32 32 | 32 
1 1 21) 001. 1 
the inverse of which 
0 © | 
82 = 
1 1 1}- 
00 1) 
This gives the correction formulae 7/Agin = (32/20) ~r- 


(32/460) (>> n)’, etc. The formula for the residual (ignoring treatments) 
of a fitted value wu is, in the usual notation, 


~ 42. 47.—G 
32 


the relevant totals being those for the completed data. It is more 


convenient for computation to calculate the quantities (32). These 
are given in Table 8. The correction formulae are then 


= 6 — i ete. 


TABLE 8 
TREATMENT Errects (UNADJUSTED) FOR U, V AND w (X 32) 
Treatment u (D) v (B) w (B) Total 
(32n) 339.4 —84.2 —84.2 171.0 
(32¢) 50.9 —41.5 —41.5 —32.1 


The corrected analysis of covariance and the final analysis for treatment 
effects (eliminating covariance) are given in Table 9. 

It should be pointed out here that the analyses in this paper have 
been performed for purposes of illustration only. It is apparent from 
the analysis of residual variance in Table 6, that in this instance no 
advantage is gained by adjusting for the concomitant variate in the 
analysis, 
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TABLE 9 


CorRECTION OF TREATMENT SuMS OF SQUARES AND Sum or Propucts 


Source of variation D.f. | S.s. (y) S.p. S.s. (x) 


| 
Treatments (uncorrected) 3 | 2370.84 | 546.46 155.04 
| 


— }°(32n)?/640, ete. —202.14 |—37.91 —9.43 

+ (3°32n)?/14720, ete. 1.99 | —0.37 0.07 
Treatments (corrected) 3 2170.69 508.18 145.68 
Residual (eliminating 

treatments) 15 519.79 106.37 297 .30 


Residual (ignoring 
treatments) | 18 | 2690.48 | 614.55 | 442.98 


ANALYSIS FOR TREATMENTS (ELIMINATING REGRESSION ON 2) 


Source of variation Ds. S.s. M.s. F 
(uncorrected) 1466.72 [488.91 {14.21 
Treatments 3 
| (exact) 1356.18 [452.06 [13.14 
Residual (eliminating 
treatments) 14 481.73 34.41 
Residual (ignoring [1948.45 
treatments) 17 
| |1837.91 
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STRATIFICATION, BALANCE, AND COVARIANCE 


D. J. FINNEY 


Department of Statistics and A.R.C. Unit of Statistics, 
University of Aberdeen, Aberdeen, Scotland 


1. INTRODUCTION 


Some experimenters seek to avoid the need for randomized block 
designs by balancing the experimental units assigned to different 
treatments in respect of an initial quality. A particularly common 
instance is that of an animal nutrition experiment in which the initial 
mean weights of animals on the different diets are deliberately equalized 
by suitable allocation of animals to treatments. Apart from difficulties 
of maintaining an objective allocation, this procedure may achieve 
almost the same end as does a randomized block design, in increasing 
the true precision of treatment comparisons by eliminating variation 
associated with the factor used in balancing. What is often forgotten 
is that, if the statistical analysis appropriate to a completely randomized 
design is used, the experimental error will be overestimated; the more 
successful the balancing is in increasing true precision, the more seriously 
will the estimated precision so calculated fall below the truth. 

The explanation, and the manner in which the difficulty may be 
overcome, can be seen from comparison of four alternative experimental 
designs. Discussion of them will throw light on the practical utility 
of the balaneing procedure. 


2. THE FOUR DESIGNS 


Suppose that ¢ treatments are to be compared by an experiment on 
N animals, r being assigned to each treatment (N = rt). Results are to 
be assessed in terms of a final measurement, y, on each animal. The 
only information on the animals available at the start of the experiment 
is an initial (pre-treatment) measurement x, which is thought likely 
to show an approximately linear correlation (positive or negative) 
with y. Often x and y are measurements of the same kind, such as 
initial and final body weights, but this is not essential to the statistical 
argument. 

Four different experimental designs are in common use: 


I Completely randomized: Allot r animals to each treatment entirely 
at random. 


Il Randomized block: Stratify the available animals in blocks of t 
in such a way that each block is reasonably 
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homogeneous in respect of x. In each of r 
such blocks, select one animal at random 
for each treatment. 


III Simply balanced for treatments: Proceed similarly to I, but modify 
the results of the randomization so as to 
make the mean values of x for the several 
treatments approximately equal. 


IV Randomized block, balanced for treatments: Proceed similarly to II, 
but restrict random selection within the 
blocks so as to make the mean values of x 
for the several treatments approximately 
equal. 


The process of allocating animals to their places in the experiment 
will vary a little according to whether there is a large, effectively infinite, 
stock from which to select or a fixed set of N animals all of which must 
be used. For design I, the difference is slight. For II, an infinite 
population would first be regarded as stratified into an infinite number 
of blocks of t, and a random selection of r blocks would be made for 
the experiment; on the other hand, if a particular N animals had to be 
used, they would presumably be stratified into the ¢ largest values of z, 
the ¢ next largest, and so on, and all blocks would be taken. If the 
population of available animals is infinite, one can conceive of a selection 
for III or IV being made at random from an infinite number of sets of 
N animals allocated so as to give exact equality of treatment means for z, 
though the mechanics of making such a selection are not easily visualized! 
If only N animals are available, there may be no allocation satisfying 
the condition exactly and, even if some arbitrary definition of approxi- 
mate equality of means of x is adopted, there is not likely to be much 
scope for random selection amongst alternative allocations. Thus one 
immediate weakness in designs IIT and IV is the danger of bias or other 
invalidity entering into the statistical analyses through the method of 
allocation of animals to treatments. An objective rule of allocation 
that would at least come near to achieving balance could easily be 
based upon a preliminary arrangement of animals in descending order 
of x followed by assignment of the first ¢ to treatments in one sequence, 
assignment of the next ¢ in the reverse of this sequence, the next ¢ in 
the original sequence, and so on. This, however, allows no scope for 
randomization except in the basic sequence of treatments. It is essen- 
tially the same as a systematic design for field-plot experiments that 
has been severely criticized by many statisticians over the last thirty 
years, and that is now generally discredited on account of the unpre- 
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dictable bias in estimation of experimental error that it may produce. 
To introduce a satisfactory randomness of allocation appears to be 
difficult, but this will be overlooked for the present; succeeding sections 
of the paper will show that, even if difficulties under this head can be 
removed, balancing has little to recommend it. 

It is perhaps well to make clear at this point that the whole paper 
is written with reference to experiments in which the experimental 


‘ unit or ‘‘plot”’ is a single animal, and measurements on different animals 


treated alike can be regarded as independent. Statisticians have fre- 
quently emphasized that, when “lots” of animals are penned together 
for uniform treatment (as is commonly done for chicks), the lot is to be 
regarded as the unit for analysis, and any calculation of an error variance 
from variation within lots is likely seriously to underestimate the truth. 
If lots were balanced so that all were nearly equal in mean or total 
value of an initial variate, « (whatever the variation of x within lots), 
the quantities x;; , y;; defined in Section 3 should be taken to refer to 
totals or means of lots instead of to single animals. There would remain 
the possibilities of random allocation of these equalized lots to treat- 
ments (designs I, II) or a further balancing of any remaining differences 
between lots in respect of x, by allocation to treatments so that totals 
of x for all treatments are equal (designs III, IV again). The incentive 
to choose designs with this second balance over treatments ought to be 
much less when a first equalization of lots has been secured; if one is 
chosen, the objections that are to be raised below in respect of experi- 
ments on single animals will still apply. 

A preliminary balancing of lots that is to be followed by a randomiza- 
tion according to design I or II falls quite outside the scope of this 
paper, and does not affect the formal validity of the statistical analysis 
for randomized experiments. However, another entirely different 
danger, perhaps manifesting itself as a breakdown of the model described 
in Section 3, needs to be considered when this type of balancing is 
adopted. An immediate consequence of the balancing is likely to be 
that some lots are much more varied than others in respect of the values 
of x possessed by their members, and effects of competition within lots 
may thereby be intensified. For example, if x represents initial weight 
in a feeding trial, the more variable lots might have quite different 
expectations of mean weight increase from those that were uniform, 
even the variance of a final measurement might be affected, and, in 
extreme instances, further trouble might arise from differential mor- 
tality. Of course, none of these disturbances will necessarily occur, 
but their possibility needs to be taken into account before such balance 
is introduced. 
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3. MODELS 


Kempthorne and Wilk [1955] have described a general type of 
linear model for experimental observations that includes as one extreme 
the infinite population of available experimental units and as the other 
extreme an experiment limited to a particular set of units. This might 
be adapted to the present problem. However, since the chief interest 
lies not in comparing I with II but in comparing both of these with 
III and IV, and there is the great difficulty of defining III and IV 
exactly without invoking an infinite population, the discussion that 
follows will be presented in terms of an infinite model. The difference 
is not important in a qualitative sense; for I and II at least, the finite 
model can be reconciled with the infinite by suitable definition of symbols. 

It will suffice to describe here the model and to define the symbols 
appropriate to II, leaving any modifications for other designs to be 
introduced as they are required. Provision must be made for studying 
the effects of analysing the experiment both without and with a co- 
variance on the initial measurement. Although, for any one experiment 
in which the analysis of covariance has been applied, the precision of 
comparisons in respect of y would ordinarily be assessed subject to the 
constancy of certain functions of x, for the purpose of comparing the 
average precisions of different designs it is necessary to consider z as a 
random variate in the population from which animals are selected. 

Write z;; , y;; for the initial and final measurements on the animal 
in block 7 that receives treatment 7. Then the linear model for x ean 
be written 


= E+ + ai; , (1) 


where ¢ is the general mean for the population, 6; and a;; are random 
variates all independent of one another and 


E(@) =0, (2) 
E(@;;)=0, =o. (3) 


The model for y must make provision for a linear regression on the 
deviation of the corresponding x from its block mean. Hence 


= nto: + 7; + Bai; + , (4) 


where 7 is the general mean, 7; is the deviation from the general mean 
attributable to treatment j, 8 is the regression coefficient (here for 
simplicity assumed independent of treatment). The ¢,; and ¢,; are 
random variates. The ¢; need not be independent of the 6; , and in 
general there will be a regression for these block variates also, which will 
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be assumed linear. Writing 


= BO, + ¥., 


the model becomes 


+ BO; + Bay; + €;. 


Inpractice, B may be expected to be equal to 8, both being the regression 


coefficient of y on x in the whole population which has been arbitrarily 
stratified on the basis of values of x. The more general model is retained 
because of the possibility that it may sometimes be appropriate. The 
random variates 0; , ¥; , a;; , €;; are all independent of one another, and 


Ey) =0, EW) =2,, (7) 
E(e;;) = 0, (8) 


2 . . . 
Note that 2, , o, are inter-block and intra-block variances for y after 
elimination of the appropriate regressions on x. 

It is further convenient to write 


(9) 


though 7’ is not necessarily, or even commonly, a variance of a random 
variate. The symbols 9; and g/ will be used for the mean values of y 
for treatment j without and with an adjustment for covariance on x 
calculated by the standard routine. 

A further assumption, that all 6; , a,; , are normally distributed, 
will be introduced. This is not essential to the main argument, but it 
enables the average variance of treatment means adjusted by an analysis 
of covariance to be simply expressed; as shown elsewhere (Finney, 1946; 
an approximation for the final average is now replaced by the exact 
value), multiplication of the residual variance about the regression by 
a factor [1 + 1/(error d.f. — 2)] gives a value for the effective variance 
per animal in comparisons between treatments. Without the assumption 
of normality, the corresponding expression must be written in terms of 
actual sums of squares of x for treatments and error, and is consequently 
more cumbrous, but qualitatively conclusions would be similar. No 
assumption that y,; , €;; are normally distributed need be made. 


4. RANDOMIZED BLOCK DESIGN (DESIGN II) 


Table 1 shows the expected mean squares and products in an analysis 
of variance and covariance. This follows a familiar pattern, except for 


the elaboration of separating the regression and residual components in 
the y analysis. 
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TABLE 1 


EXPECTATIONS OF MEAN SQUARES AND PRODUCTS FOR 
RANDOMIZED BuLock Desien (II) 


Variation D.f. (x?) (ry) (y?) 

2.2 2 
r—1 of + 022 | Bo? + | % 

+ + 
Treatments | Bo? Bot + ot 
Bo: B°o? + 0} 


The mean squares of y for treatments and error are equal in their 
expectations except for the 7’ component in the former, and therefore 
the error mean square in an analysis of variance of this design leads to 
an unbiased estimate of the variance of the treatment means: 


V@) = + + 3). (19 


Moreover, if an analysis of covariance is made according to the standard 
routine, using sums of squares and products for error and treatment-plus- 
error, the residual mean squares for treatment and error are again equal 
in expectation except for the 7 component. Hence the residual error 
mean square leads to an unbiased estimate of the average variance of a 
treatment mean: 


5. COMPLETELY RANDOMIZED DESIGN (DESIGN I) 
If the same N animals as in the randomized block design were allo- 
cated r to each treatment entirely at random, the variance between the 
blocks would be added proportionately into sums of squares and products 


for treatments and error, and the expected mean squares and products 
would be as in Table 2, where 


h = tr — 1)/(rt — 1). (12) 


Again an unbiased estimate of variance is obtainable from the error 
mean square for y, and 


(13) 


Vg.) = + + of + + 2D]. 
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TABLE 2 


EXPECTATIONS OF MEAN SQUARES AND PRODUCTS FOR 
ComPLETELY RANDOMIZED Desien (I) 


Variation Dz. (x?) (ry) (y?) 


2 2 2 
Treatments | + h>? Bo? de hB>? B Oz + Oy 
+ + + rT 


2 2 


u(r —1) | o2 + | Bo? + | % 
+ + 


The estimation of error again remains unbiased under an analysis of 
covariance. The expected value of the average variance of a treatment 
mean is a complicated expression, though its estimate is obtainable by 
ordinary analysis of covariance processes for any actual records. 


1 2 ( 1 ) 

vg) = ot + + o + 5) (14) 
Not only does it involve the y constituents of (13), in the manner that 
(11) is related to (10), but there is an additional item depending upon 
the difference between the two regression coefficients B and 8, which 
will therefore often be zero. 

If the animals for the experiment were selected independently at 
random from the infinite population, without regard to potential 
blocks, instead of being necessarily a set of N suitable for a randomized 
block design, Table 2 and equation (13) would be modified by writing 
h = 1. The change is slight when ¢ and r are moderately large. 


6. RANDOMIZED BLOCK DESIGN, BALANCED FOR TREATMENTS 
(DESIGN IV) 


If N animals are randomly selected and allocated to animals with 
the dual constraint that they shall consist of r replicate blocks of ¢ 
and that the ¢ treatment means of 2 shall be equal, the analysis of 
covariance will be analogous to that in Table 1 but all constituents of 
the treatment line relating to « become zero. The total and block lines 
of the analysis are the same as if a true randomized block experiment 
were being conducted on the same animals, and consequently any 
reduction in the treatment line must be compensated by an increase in 
the error line, so keeping the sums of squares and products for treatment- 
plus-error unaltered. Table 3 shows the expected mean squares and 
products. 
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TABLE 3 
I.XPRCTATIONS OF MEAN SQUARES AND PRODUCTS FOR 
Ranpomizep Biock DestGNn BALANCED FOR TREATMENTS (IV) 


| 


(@y) (y?) 

| + + 

0 +rT 


Doubtless many experimenters would consider that sufficient 
attention had been paid to x by balancing for this variate, and would 
therefore be content to make an analysis of variance on y alone. Com- 
parison of the expected mean squares for treatment and error shows 
that their estimate of variance would be biased. 


a treatment mean is 


all dependence on x having been eliminated by the balancing, but an 
estimate uncritically formed from the error mean square would have as 


its expectation 


The more marked the intra-block regression of y on x, the more suc- 
cessful is the balancing but the more serious is the consequent over- 
The expectation of the biased estimate in fact 
exceeds the variance that is appropriate to design II. 

The fault is very easily remedied. 
covariance analysis is completed by the adjustment of y for 2, the 
expected mean square for treatments is unaltered and that for error 
Thus unbiased estimation is achieved. Snedecor 
[1956] mentions that H. L. Lucas has reached a similar result in an 
Because of the balance, the adjusted treatment 
means are the same as the unadjusted and 


estimation of error. 


becomes simply o, . 


unpublished paper. 


= 


VGQi) 


2; 


o,/T. 


The true variance of 


If the routine procedure for a 
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7. SIMPLE BALANCE FOR TREATMENTS (DESIGN III) 


Table 4 is the modification of Table 2 required when the block 
classification of the N animals is ignored but treatment means of x 
are equalized; it bears to Table 2 the same relation that Table 3 does 
to Table 1. 

Again the bias in the estimation of treatment variances that arises 

if yalone is analysed is apparent. The true variance of a treatment mean 
is 


= + hz], (17) 


whereas division of the error mean square in Table 4 by r would give an 
estimate whose expectation also involves o2 and =? and exceeds that 
in (13). Adjustment for covariance on x leaves treatment means 
unaltered, because of the balancing, and therefore 


The residual mean square for error still leads to a biased estimate of 


this quantity, the variance of the means estimated from it having 
expectation 


rt—1 
othe? 


E + (19) 
Hence variance estimation is biased unless either 23 = 0 or B = 8; 
the first is unlikely, since it would represent complete failure to stratify 
for z, but as already noted the second will be true in many experiments. 
Moreover, o% will be relatively small in a successful stratification. 

If the N animals were selected independently at random from the 
infinite population, Table 4 and (18) and (19) would be modified by 
writing h = 1. 


8. COMPARISON OF DESIGNS 


When stratification based on x has been well arranged, o2 , the 
variance within strata will be very small. In a randomized block 
design, little might be gained by a covariance adjustment for x unless 
the intra-block correlation of x and y were so large as to make 6’o? 
comparable in magnitude with o; ; this can be seen from (10) and (11). 
Indeed, in rare instances, if o2 were negligible the adjustment might 
do more harm than good, since the elimination of 8’o? from (10) might 
be more than balanced by the extra factor in (11) that represents the 
average effect of the sampling errors of the adjustments. If by accident 
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stratification has been less satisfactory, a covariance adjustment can 
still remove most of the ill effects, especially if r and ¢ are large. 

The inferiority of the completely randomized design scarcely requires 
comment, and is made clear by comparing (13) and (14) with (10) and 
(11). Without a covariance adjustment, the variance of treatment means 
is inflated by components of variance between the potential blocks. 
Even with adjustment, the variance contains a component depending 
upon such inter-block variance of y as is not associated with variations 
in z and a second component depending upon any difference between 
B and 8; the final factor in (14) is smaller than that in (11), but this 
will rarely suffice to compensate for the other differences. Except 
perhaps for very small experiments, where the additional degrees of 
freedom for estimating error may be especially valuable, the completely 
randomized design will usually be less efficient than the randomized 
block. Under the conditions most favourable for complete randomi- 
zation, namely L; = 0 and B = 8, the true precision of this design 
relative to that in randomized blocks is 


(VN —t—2)(N 


a quantity only very slightly in excess of unity for any experiment 
with more than 20 animals. Making allowance for the further loss in 
precision when variances must be estimated from the experiment, by 
use of the formula (d.f. + 1)/(d.f. + 3) (Fisher [1951] §74) the efficiency 
becomes 


(N — t— 2)(N — —r — — r — t+ 8) 
which is slightly greater than the previous expression, but still too 
small to favour the completely randomized design. For 4 treatments 
tested on 5 animals each, it is only 1.065, and the 6.5% advantage can 
easily be balanced by a small value of 53 ; for 2 treatments tested on 
6 animals each, the value 1.38 gives more encouragement to complete 

randomization. 

The chief interest of this paper lies with the two balanced designs. 
The argument of Sections 6 and 7 has shown that the variance estimates 
will be seriously biased unless a covariance analysis of y on 2 is used. 
For randomized blocks with treatment balance, the true variance of 
means is slightly less than that for ordinary randomized blocks; com- 
parison of (16) with (11) shows the precision to be increased by a factor 


N-r—-t-1 
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In an experiment involving 20 animals, this may represent a gain of 
about 10%, and in an experiment on 40 animals about 4%. Similarly, 
for the completely randomized design the true variance of treatment 
means is smaller with treatment balance for x than without it, if B 
and 8 are equal, as is seen by comparing (18) and (14). The precision 
is increased by a factor 


N-—-t-1 
N-t-32 


so that the gain is rather less than before. Tor either balanced design, 
covariance analysis is essential to unbiased estimation of variances. 
One more comparison relevant to current practice is that between 
designs II and III. To some experimenters, the attraction of a simply 
balanced design is an avoidance of the need for randomized blocks. 
Whether or not the variance of unadjusted treatment means favours 
the balanced design depends upon the relative magnitudes of 2} and 
B’o% , as may be seen by comparison of (17) with (10); there is no basis 
for asserting the general superiority of III. If means are adjusted for 
covariance with zx, comparison of (18) with (11) shows that unless 
>; is very small the advantage lies with randomized blocks. Even 
this argument presents the balanced design too favourably, for it 
ignores the impossibility of estimating (17) unbiasedly from the results 
of the experiment except when B = 8; when this condition is satisfied, 
a covariance analysis leads to unbiased estimation, but if a covariance 
analysis is to be undertaken in any case, it might as well be done on a 
randomized block design where the results are likely to be still more 
precise. 

Thus in terms of increase in precision of comparisons or decrease 
in variance of treatment means, the advantage for designs in which 
the treatments are deliberately equalized in respect of mean values 
of the initial measurement is small, except for experiments involving 
less than 20 animals in all. Moreover, the analysis here has assumed 
that perfect balance is attained; any inequality in the means of z 
for the treatment groups will reduce the precision to an intermediate 
value. Against any advantage must be set either the technical dif- 
ficulties of arranging the balance objectively or the theoretical objections 
to the balance usually achieved on the grounds of its subjective character. 
In the writer’s opinion, balance of this kind is very seldom worth the 
price that is paid for it, and he believes that most experimenters who 
are attracted by its apparent virtues would be wiser to abandon it. 
In special circumstances, the decision between balanced designs and 
orthodox randomized designs should perhaps rest with the experimenter, 
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but he needs to be aware of the main conclusion from statistical theory 
that he cannot greatly increase his precision by adopting balance. 
The statistician may perhaps be forgiven for wondering why there 
should be anxiety to avoid a design that is as simple as randomized 
blocks, especially when the considerations advanced above show that, 
even under the most favourable circumstances, the advantage for a 
balanced design must be small and often it will be substantially inferior. 

A possibility of attaining a degree of balance in respect of x closer 
than will usually occur with randomized blocks, yet without the dangers 
of bias inseparable from designs III and IV, may finally be noted. If 
ris an integral multiple of ¢t, the animals might first be allocated to r 
blocks as for randomized blocks, then arranged in descending order 
of x within each block, after which the treatments could be assigned 
in respect of blocks and order within blocks according to r/t randomly 
selected Latin squares. The standard Latin square analysis would then 
eliminate from the comparisons of treatment means for y any additional 
component of variation associated with order within blocks. Covariance 
analysis might still lead to further gain in precision. If r/t is not an 
integer, Youden squares and other incomplete Latin square schemes 
might be employed, at some considerable cost in respect of computa- 
tion-time. In practice, probably this Latin square constraint will 
seldom be very advantageous, but it has the merit of increasing pre- 
cision without a covariance analysis in the same circumstances that 
would make design IV more precise than simple randomized blocks. 
The loss of information arising from the fewer error degrees of freedom 
in the Latin square will scarcely be important except in very small 
experiments (e.g. a single 4 X 4 Latin square) in which the additional 
constraint is completely without effect. 

9. SUMMARY 


Experimenters sometimes impose on the animals or other experi- 
mental units allocated to different treatments the condition that treat- 
ment means in respect of some preliminary measurement, believed to 
be correlated with the measurement that is to be used in assessment of 
the results, shall be approximately equal. Such balance may be in- 
corporated into completely randomized or randomized block designs. 
The expected variances of treatment means for these designs and for the 
corresponding randomized designs without balance have been studied 
and compared. Three major criticisms of the balanced designs emerge: 


(i) It is difficult to define an objective plan of allocation of units 
to treatments that will fulfill the requirements of balance yet 
allow the particular allocation adopted to be regarded as a 
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random selection from a number of possibilities, and so give 
some hope that estimates of means and variances unbiased by 
subjective influences can be obtained. 


(ii) Even though the dangers of (i) be disregarded, uncritical 
analysis of balanced design will seriously overestimate the var- 
iances of treatment means. By use of a covariance analysis with 
the initial measurement, all or most of this bias can be removed, 
but this essential step in the analysis of a balanced design 
destroys much of its advantage in respect of simplicity and 
and speed of calculation. 


(iii) When (i) has been disregarded and the covariance calculation 
demanded by (ii) completed, there still remains the question 
as to whether any gain in precision has been achieved. The 
answer is that, except possibility for very small experiments, 
the gain is at best of the order of a few percent and is almost 
always too small to be worth setting against the risk of bias 
under (i) and the loss in precision that can occur if conditions 
are not optimal. More particularly, a randomized block 
design in which the blocks are based upon the known values of 
the initial measurement will almost certainly be more precise 
than a design in which blocks are abandoned in favour of 
balance. 


Thus the disadvantages of balance of this kind seem far to outweigh 
the advantages, and such designs should not be adopted unless special 
circumstances dictate their advisability. 

The possibility of using Latin square and allied designs as alternatives 
to the type of balance discussed has been briefly mentioned. When 
there is anything to be gained by balance, these Uesigns avoid the 
theoretical objections and will usually have almost equal merits. 
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THE ANALYSIS OF COVARIANCE AS A 
MISSING PLOT TECHNIQUE 
Irma Coons 
Westinghouse Electric Corporation, East Pittsburgh, Pa., U.S.A. 


INTRODUCTION 

The occurrence of missing data in a statistically designed experiment 
requires some modification of the routine technique of analysis. For 
certain simple designs, such as randomized blocks and Latin squares, 
formulae are readily available by which a single missing value may be 
estimated, and the single missing value procedure may be used itera- 
tively to estimate several missing values. The analysis of the augmented 
data may then be completed as usual after certain corrections for bias 
are made. One of these—a decrease in the degrees of freedom associated 
with total and with error sources of variation—is easily made. The 
other correction is necessitated by the fact that the treatment sum of 
squares, as calculated from the augmented data, is incorrect. In the 
case of a single missing value the treatment sum of squares is too large. 
It can be shown that this holds with any number of missing values. 
The general procedure for getting a correct treatment sum of squares 
‘an of course be applied. 

The work required to estimate the missing values and the resulting 
bias may become very tedious. Also, situations may arise for which 
no general formula is available. For instance, a fractionally replicated 
design does not always use the same effects to obtain an estimate of 
residual error. Since an estimate of a missing value is a function of 
the error term, a formula would have to be recalculated for each problem. 
Also, missing observations in split plot designs may cause problems. 
Some general method of handling missing observations in any situation 
is a needed tool. 

The purpose of this paper is to illustrate the full details of a method 
which can be used when one or more missing observations exist in an 
experiment of any statistical design. The advantages of this method 
are its generality of application and the ease with which exact tests of 
significance may be obtained. Here the word “exact” is used to mean 
exact when errors are normally and independently distributed. The 
technique employs the computational procedures of a covariance 
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analysis using a dummy X covariate. Originally given in a paper by 
Bartlett [1937] and also described by Anderson [1946], the method 
does not seem to have been exploited to the full possible advantage and 
merits further description. 


GENERAL USE OF COVARIANCE TO DEAL WITH MISSING DATA 


In this section some properties will be listed which give the justifica- 
tion for the computational procedures to be described. Readers who are 
interested only in the computational application of the missing plot 
technique may omit this section. Some of the following statements are 
easily proved; others are more difficult except with matrix notation, 
The properties shown here serve to clarify some obscurities in missing 
plot techniques discussed by Norton [1955], Nelder [1954], and Smith 
[1957]. Property 1 is due to Fisher, property 2 has been implicitly 
assumed by many authors, property 3 is due to Bartlett [1937], prop- 
erties 4, 5, 6 have been obtained by Kempthorne (unpublished), but 
may be known to a number of workers. 

1. If an analysis of variance is made with symbols 8, , 8, , --- 8, 
in the place of missing observations, then the best linear unbiased 
estimates of the missing observations are the quantities 8, , 8, , --- B, 
which minimize the error sum of squares. 

2. Given that, with full data (y; , y2, --- y,), the best linear unbiased 
estimate of some linear function of the parameters is »,y,; + 
VY2 + +++ +,Yn, then the best estimate of that function with missing 
data is obtained by replacing the missing y’s with the missing value 
estimates. 

3. Let the data be observed data where obtained and zero where 
missing. Introduce a concomitant variable X,,(m = 1 --- q) corre- 
sponding to the mth missing observation; let X,, take the value —» 
for the mth missing observation and zero for all others, missing or not. 
If the error partial regression coefficients obtained from an analysis of 
covariance are denoted by , 8. , --- 8, , then vf, , v8, , vB, are 
the best linear unbiased estimates of the missing observations. 

4. Estimates of functions of data with missing observations, and 
variances and covariances of these estimates may be obtained by the 
routine application of formulae for adjusted means in the analysis of 
covariance; i.e. by regarding the zero yields supplied in the analysis 
of covariance procedure as having variances of o”. The above statement 
applies to functions of the augmented data; the variance of a missing 
observation per se is given by statement 5 following. 

5. Denote the error sum of squares of X; by #; and the error sum 
of products of X; and X; by F,; . Then the variance of the 7th missing 
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value estimate is (v’u;; — 1)o° and the covariance of the ith and jth 
missing value estimates is v’u,;;o", where 


Ui2 *** Uig iN | 
Ey, 0 | 

| 0 
Ea Ua Uqa) \ 1 


6. The sum of squares for treatments obtained by analyzing the data 
augmented by the missing value estimates is always greater than or 
equal to the exact sum of squares for treatments. 


APPLICATION OF COVARIANCE TECHNIQUE TO ONE 
MISSING OBSERVATION 


Let n equal the number of observations in the experiment including 
the missing one. Consider the original data as the Y variable of the 
covariance analysis and insert the value of zero in the cell which has 
the missing observation. Set up an X variable which takes the value 
of —n in the cell corresponding to the substituted zero value and the 
value of zero elsewhere. The choice of —n for the concomitant variable 
leads to the simple consequence that the sum of squares for any con- 
comitant variable is equal to n X (degrees of freedom), as described 
below. 

The usual computational procedures of the covariance analysis 
automatically provide unbiased tests of significance. However estimates 
of functions of the Y data, such as treatment means, must be adjusted 
to the value of zero for the concomitant variable rather than to the 


observed average value of X as is usually done. For example, an 


adjusted treatment mean is not estimated by Y., — @(X.; — X..), 
the covariance formula generally used, but rather by the formula: 


adj Y.; Y BX, (1) 
where 
A E,,y 
B E., 


and #,, and E,, are the error sum of products and sum of squares 
respectively, in the analysis of covariance. With a hierarchical clas- 
sification, the adjustment is to the average of the next higher classifica- 


tion until the highest classification is reached, and the adjustment is 
then to zero, as shown above. 
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The variance of (1) is given by: 
V(adj Y.,) = V(¥.,) + (X.)°V@) (2) 
where (8) is taken to be o°/E,, , and o” is the true variance of any 
observation. This is estimated by replacing o” by s*, where s° is the 


residual error mean square resulting from routine application of the 
analysis of covariance. That is, 


4 

where £,,, is the covariance error sum of squares of the y’s, and the 
degrees of freedom are equal to the error degrees of freedom with full 
data, less one. 

Any comparison among the adjusted treatment means, such as: 

aj T. = (7.2. ¥.) — Ad) 


may in general be designated by the notation 


The variance of such a comparison is: 
V(C,) + C2V(8), (4) 


where V(C,) is calculated assuming no missing observations and 
V(8) is taken to be o° /E,, . 
As noted in statement 3 above, 
> 4 E,, 
Missing observation = np = n 7: 


From statement 5 above, the variance of ng is 


2-1). 


However the complete analysis of the data may be performed without 
estimating the missing observation as such. 

The approximate test of significance for any source of variation, 
say 7’, is obtained by computing the biased sum of squares of 7, as 


where 7,, , T., , and T,, are respectively the sum of squares for y, 
of products for x and y, and of squares for x, for the particular source 7. 

Of course, when there is a missing value formula available for the 
particular case under consideration, this approximate sum of squares 
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can also be obtained by estimating the missing value by this formula 
and then analyzing the augmented data. However, the virtue of the 
covariance technique is that only minor supplementary computations 
make the exact test readily obtainable as the ordinary unbiased analysis 
of covariance test. This is exemplified in the examples given. 

The additional computation required by the use of an X covariate 


_js relatively slight, due to the simple nature of the XY data. In most 


situations, the corrected sum of squares of X attributable to any given 
source of variation (>> x?) will simply be 


n X (degrees of freedom for the given source of variation). 


If, of course, one were analyzing the data by a high speed computing 
device, there would be no point in working out the partition of the sum 
of squares of x or products of x and y algebraically, since the cost of 
doing this on the device would generally be negligible. From this 
point of view, the procedure seems preferable as a universal missing 
plot procedure to that proposed by [lartley [1956]. 

It is also possibly worth mentioning that this covariance device 
is one extremely effective method for obtaining missing plot formulae. 
The reader may verify that the derivation of missing plot formulae for 
randomized blocks and Latin squares, for example, is entirely trivial by 
the covariance procedure. For incomplete block designs, the covariance 
procedure is relatively simple to apply, whereas a missing value formula 
could be complex. 


EXAMPLES OF COVARIANCE APPLIED TO ONE 
MISSING OBSERVATION 


Fractionally Replicated Design of a 2° X 4 Factorial 


The data used to illustrate this type of design are a part of a larger 
experiment run at Westinghouse Electric Corporation. The experiment 
was designed to investigate the effects of various processing variables 
upon the corrosion properties of ZIRCALOY. The measurement used 
is weight gain per unit surface area, and the factors considered in Table | 
are listed below. These treatments were run in factorial combination 
but only a 1/4 replicate was used. 


Position in Autoclave (4) 1. 2 (Lowest level tested) 
2. 4 
4.8 
Control of vacuum in 1. Minimum leak rate 
Annealing (B) 2. Constant leak rate 
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Cleaning treatment, prior . An alkaline cleaner 


to pickling (C) 2. A vapor degreaser 
Acidity of the first rinse, 1. pH 6-8 
after pickling (D) 2. pH 2-3 
Method of the second rinse, 1. Sprayer 
after pickling (7) 2. Hose 
Acidity of the third rinse (F) 1. pH 6-8 
2. pH 4-5 


The missing observation is the treatment combination a3b.c.d,¢,f, , 
as shown in Table 1. The X values are not shown for lack of space, 
but they would simply consist of —32 = —vn in the cell Xg2.2,, , and 
of zeros elsewhere. 

The covariance analysis of the data is given in Table 2, with all the 
computations relative to a given treatment listed on a single line. 
All factors are considered fixed, not random. 

The computations in this experiment may appear long, but are quite 
simple. The X analysis of column (3) may be written down at sight since 
> x? = n X (degrees of freedom). 

For the factors and interactions of column (2) with a single degree of 
freedom, the sum of products of x and y is simply 


Y, = Y, 


where Y, = total of Y observations for that level of the effect or inter- 
action which does not contain the missing observation and Y, = total 
for that level which does contain the missing observation. Also for 
these single degree of freedom effects and interactions, the sum of 
squares for y is simply 


(Y, — Y2)*/n. 
Column (4) of Table 2 gives those approximate mean squares which 


would be obtained by an analysis of the data augmented by the esti- 
mated missing observation. This is equivalent to an analysis of 


Y — BX 
so the approximate sum of squares may be calculated as 


From statement 6, these approximate mean squares are known to be 
too large. Therefore any of these which are not significant may be 
eliminated from further consideration, and the calculation required in 
the analysis may thus be shortened. 
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Since all factors are considered fixed, the residual variation is used 
as the error term throughout. Therefore the (Treatment + Error) 
columns are obtained by simply adding the same residual term to each 
significant treatment effect in the column; similarly the constant adjusted 
residual, 23.87, is subtracted from each line of column (8) to obtain 
adjusted treatment sum of squares. 

Thus the completed analysis is obtained and exact tests of significance 
have been made as shown. One table of means, that of the interaction 
between the effects of Type of Vacuum X Acidity of First Rinse (B X D) 
has been presented (Table 3) as an example of the procedure which 
must be followed to obtain the appropriate means and their variances. 


TABLE 3 
TABLE OF MEANS 


Type OF Vacuum X Aciprry or First Rinse (B X D) INTERACTION 


Type of vacuum (D) | Acidity of the first rinse (B) 


pH 6-8 pH 2-3 
Minimum leak rate 32 31 
Constant leak rate 36 42 


Missing observation occurred under the Constant Leak 
Rate Vacuum at pH 2-3. 


37.6 — (1.2) (—4) = 42 

8p [1/8 + (—4)2/416] = .33 
Sp (1/8) = .25 


a 

| 


V (Adj Y.2.2..) 
V (Y 2, & 2) 


Split-plot Design With One Missing Observation 


Another phase of the ZIRCALOY experiment was concerned with 
the effects of production sources of variation upon corrosion of ZIR- 
CALOY. The factors here considered are: 


Sponge producer (S) . Producer 1 
. Producer 2 
14 

28 

42 

84 

. Low level 

. High level 


Time (days) (7’) 


Position in the autoclave (P) 


bo 


| 
| | 
rod 
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Duplicate readings were taken on each treatment combination. The 
factors were run in a split-plot design with positions as the subplot. 
Data given in Table 4 are a function of the logarithms of the weight 
yain per unit area. ‘The subplot taken as missing is associated with 
treatment combination f,s.p, . Again the X data are not shown but 
may be easily visualized. 

The covariance analysis is given in Table 5. The new feature of this 
design is the presence of more than one error term. Actually, two 
estimates of the missing observation are available—one (n@,) applicable 
to the individual subplot and one (nf) to the whole plot which contains 
the missing subplot. This purely technical point may be worrisome to 
the non-statistician but a fully adjusted set of means should remove the 
difficulty. The procedure by which the mean of every whole plot and 
split-plot treatment combination may be written down is given by 
Kkempthorne [1952, p. 388]. It is to be noted, however, that the adjust- 
ment is to a zero value of the concomitant. One finds that existent 
observations occurring in a whole plot with missing sub-plots are adjusted 
in such a way as to retain all comparisons given by the existent data. 

The analysis and tests of treatment effects proceed as usual for a 
split-plot covariance analysis. A linear corrosion rate per day, a main 
plot effect, is given by the contrast 


(953 — 16.7(8)] = 4.18 
(From Equation 3) 
and the variance of this contrast is 


196° 


[J ( 7] J (8 ) 196° | 8 8 | 27 


(From Equation 4) 
The adjusted difference between high and low positions in the auto- 
clave, a subplot effect, is estimated by 


Y, — Y, — £,(X: — X.) = [74 — 14.0(2)] = 46 (From Equation 3) 


with a variance of 


2 2 
V(C,) + = | 2 | = 28 (From Equation 4) 
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APPLICATION OF COVARIANCE TECHNIQUE TO MORE 
THAN ONE MISSING OBSERVATION 


With more than one missing observation a multiple covariance 
analysis is required. Again let m equal the number of Y observations 
in the experiment including the missing ones. Assign a value of zero 
to those Y observations which are missing. Set up a concomitant 
variable X,, for each missing observation. Each of these X,, will 
consist of zero in all cells except in that cell corresponding to the missing 
observation with which the given X,, is associated; in that one cell it 
will have a value of —n. 

With q missing observations, a multiple covariance analysis must 
be performed on Y and the q covariates X,,. An outline of a general 
multiple covariance analysis is given in Table 6 for a randomized block 
experiment with two missing observations (¢ = 2). The computations 
required to obtain the >> x,,c, and >> x,y are simple, since each X,, 
is associated with a single missing value and therefore has only one 
non-zero cell. In computing >> z,,7, , two situations may be encountered. 

1. The two missing values associated with X,, and X,, occur in the 
same level of the given source of variation. The results then are exactly 
the same as those obtained for >> 2°; i.e. 

+» Ln, = n X (degrees of freedom for the given source of variation) 
in most cases. 

2. The two missing values occur in different levels of the given 
source of variation. Then, for most cases, 


tntn = 


where r is dependent upon the hierarchical classification which is used. 
When no hierarchical classification is present, r = 1. 

When the given source of variation is an interaction effect, then the 
corresponding main effects and lower order interactions must be sub- 
tracted from the above = 

A complete analysis of covariance will provide unbiased tests for 
treatment effects. Each missing observation Y,, may be estimated by 
nb, = n X (error estimate of the 8 associated with that missing obser- 
vation). 

If q observations are missing, their estimation requires the solution 
of a set of simultaneous equations of size q X q in order to obtain esti- 
mates of the ,,, , as shown in Table 6. Each unbiased test of adjusted 
treatment effects requires the solution of an additional set of simul- 
taneous equations of size qg X q. In general, the solution of these equa- 
tions will be quite easy, even by an iterative technique, because usually 
the off-diagonal terms of the matrix of coefficients will be small. 
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Even so, the amount of computations required may be reduced by 
performing an approximate analysis of variance on the Y data augmented 
by the missing value estimates n@,,, , as was done in Table 2 with a 
single missing observation. Since these approximate treatment sums 
of squares are known to be biased upward (seé statement 6), any treat- 
ment effects which are not significant in this approximate analysis will 
be non-significant in an exact analysis; exact tests may be made on 
significant treatment effects if desired. With the above procedure the 
missing values are estimated exactly without tedious iteration, and the 
approximate analysis is available from the solution of a single q X q 
set of simultaneous equations. 

A virtue of the covariance technique whenever split-plots occur is 
that the various cases of missing split-plots in the same or different 
whole-plots are taken care of automatically. If, for instance, two missing 
split-plots occur in the same whole plot, the two corresponding con- 
comitants will have perfect correlation for the whole-plot analysis. 
When they are thus identical in sets, only one from each set should be 
taken into account in finding the whole-plot error regression. In such 
a situation only the sum of the regression coefficients (3,,) associated 
with these concomitants can be estimated. However, because of the 
perfect correspondence in whole plot totals, this sum is all that is 
needed to obtain adjusted Y values. [Tor instance, with two missing 
split-plots in the (77) whole-plot, the adjustment process in the simple 
split-plot situation for split-plot (zjk) is: 


— — — — — — £2; ;.). 


But #,,;. equals Z,;;, so that we need only (8, + 8) and the routine 
procedure provides this quantity. The above adjustment formula shows 
that all split-plot observations in the whole-plot containing missing 
values are subjected to an adjustment; however, this adjustment is 
such that the split-plot comparisons as given by the actual data are 
retained. 


EXAMPLES OF COVARIANCE APPLIED TO MORE THAN 
ONE MISSING OBSERVATION 

Split-Plot Design With Two Missing Subplots From Different Whole Plots 

These data are taken from a larger experiment investigating the 
effects of various production factors upon changes in energy loss as raw 
steel is made into transformers. The factors shown in Table 7 are: 

Coating used on the sheet of steel (A) 1. Glass 

2. Carlite 


q 
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Location from which strip was taken (B) 1. Edge of sheet 
2. Center of sheet 

Tension under which strips were wound 1. Low 

into rings (C) 2. High 


The factor A, replicated four times in each level, forms the main plot. 
The split plots are the four treatment combinations of factors B and C, 
The missing values are treatment combinations a,b,c, in replicate 4, 
and a,b,c, in replicate 3. For compactness the tables of X, and X, 
are condensed into a single table; actually each table should consist of 
a single —32 = —n in the appropriate cell and zero elsewhere. 

After obtaining estimates of the missing values, an approximate 
analysis was performed upon the data augmented by main plot estimates 


TABLE 7 


DaTA FOR THE Spiit-PLoT DgEsIGN Two MiIssinG OBSERVATIONS 


Coating used on the steel (A) Glass Carlite 
R, R. Rs Rs Ri R. Rs R, 


Location of Tension Y Data 
origon of strips (B) (C) ——— 
Edge Low 75 42 75 | 00] 62 


High 70 Ot 68 50 59 66 106 102 


Center Low 122 91 102 39 50 74 «(34 
High 114 125 84 72 82 46 77 ~=—56 
X Data 
— 32 
Edge Low 00 00 00 00 00 O00] =X, | 00 


Center 


: 
cA 
| 
| 
| High 00 O00 O00 00 00 00 00 
| — 32 | 
Low | 00 00 00 00 00 00 
| 
eo. | High | 00 00 00 00 | 00 00 00 00 
| 
| 
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when computing main-plot effects A and Error (W), and by subplot 
estimates elsewhere (Table 8). In the split-plot design there are two 
approximate tests for whole-plot effects—one obtained solely from the 
main-plot analysis with data augmented by the main-plot estimates 
Nb » MBzy » *** , and the second based on the analysis of the data 
augmented by the split-plot estimates n8,, , n3.,, --: . In order to 
obtain the correct subplot error term in the approximate analysis, 
subplot missing value estimates must be used in obtaining the main 
plot as well as subplot effects of the analysis. In Table 8, for instance, 
if the AB interaction effect were to be computed by subtracting A and 
B main effects from the raw corrected AB sum of squares, then the 
estimate of A which must be used is not 5832 as shown for the main 
plot analysis, but rather (1215 + 84 — 970 — 86)°/32 = 1847, which 
is the A sum of squares using subplot missing value estimates. There- 
fore, the easiest method of obtaining the subplot error terms for the 
approximate analysis seems to be by the usual covariance formula 


since the error terms in the covariance analysis and approximate analysis 
must be the same and since the i have already been calculated. 

In the approximate analysis of Table 8, the effects of A and AB are 
significant. Exact tests for these effects are then shown. In order to 
obtain the exact sum of squares for A, for instance, the following 
adjustment must be applied to Effect A + Error (IW): 


2 


+ 
=1 


m 


where the 8,,,, , must satisfy the equations (see Table 6): 


—328, + = 129 
Therefore 


= 3.55 


1.08. 


Hence the adjusted [A + Error (W)] sum of squares is 


m 


= 7845 — 3.55(761) — 1.08(129) = 5004. 


) | 
9 
m 
4 
4 
LJ 4 
q 
| 
0 
q 


ll 


6I— = anyea | 090- = ‘OII— = O 
= onyea Sutsstpy { puv = | ‘9001 = “g O + (ZE)O/ ““g 


OFS 9F8E | S8GIT SFST FIST O | 101g 


7s= 9FSE 


BIOMETRICS, SEPTEMBER 1957 


| #206 I8OFI ZE | | | 960% ZE ze I aV 
II II 6 LI- LI I Od 
C622 122 126 ce I 2 
| $69 #69 | I q 
yojdqng 
4's=COI | 829 cor 829 | 696¢ 9II— 9001 O | 9 | (A) 


ureyy 


ONISSIPY OMT, HLIM NDISAC] LOTG-LITdg dO SISAIVNY 


8 ATAVL 


at 404 
ae 
— | 
| 
- 
| 
| 
| | 
2 
| 
| | 
| 
a, 
| 
Sa 


MISSING PLOT TECHNIQUE 405 


Subtraction of the correct error term, Error (W) = 628, from this gives 


the A sum of squares, 4376, which will provide an exact test for treat- 
ment effect A. 


The author is greatly indebted to O. Kempthorne for the basic 
theorems on which the technique is based and for suggestions on the 
preparation of this paper. 
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A. AUGSBERGER (Niirnberg). Statistical Tests on Thera- 
416 peutic Trials of Chronic Diseases, Performed on the Estimation 
of an End Value. 


Therapeutic trials in chronic diseases involve observations of 
quantitative (e.g. blood pressure in hypertension) or graded symptoms 
in the same patient during a first period (where there may be no therapy 
or placebo or drug A) and during a second period (drug B). The 
physician’s main question is whether the value of the symptom at 
the end of the second period is more favorable than one would expect 
had the first period continued unaltered. Fitting a curve to the observed 
values (or their logarithms, resp.) of each period (a straight line as a 
first approximation) the author examines the difference between the 
value of the symptom estimated by extrapolation of the first period on 
the last day of the second period and either the observed or the regression 
value of the second period on the same day. The standard error of the 
difference is obtained from the pooled error variance about the two 
regression lines. 


R. kx. BAUER (Kxrefeld). Statistical Investigations with Auto- 
matic Computers. 


417 


Automatic computers used for investigations on statistical data (a) 
reduce writing by hand to a minimum, and computing by hand or 
table machines to zero; (b) eliminate nearly all copying mistakes; 
(c) replace a slow staff of arithmeticians; (d) warrant high actuality of 
computing results by high speed of computing procedure; (e) render 
possible a more profound analysis of data. Therefore not only the 
largest, but all statistical investigations of an institute, a hospital or of 
industrial works may be done by automatic computers.—To get these 
advantages three conditions have to be fulfilled: (i) an efficient and 
reliable computing aggregate has to be on hand without delay; (ii) data 
have to be surveyed on punchable blanks, their documentation has to 
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be made on punch eards; (iii) a library of universal computing programms 
for all standard statistical methods has to be compiled most quickly.— 
(i) was discussed; on (ii) some illustrations were shown; on (iii) a Bell- 
programmed total correlation and regression analysis by means of the 
IBM Magnetic-Drum-Data-Processing-Machine type 650 was de- 
monstrated. 


418 H. GRIMM (Jena). Some Problems of Bacterial Distribution 


and Counting. 


1) In counting particles, time may be saved when a modified count- 
ing method is employed. The mean and standard error are estimated 
from the truncated Poisson. A little loss in precision is compensated 
by an essential saving of time. 

2) The type of distribution is readily recognized by plotting on 
Poisson cumulative probability paper. The various cumulative curves 
(Poisson, binomial, Neyman, negative binomial, log-series) become 
evident from this diagram, and their parameters may be estimated by 
means of transparent stencils. Some empirical compound distributions 
are discussed. 

3) Evidence is given of the difficulties involved in photoelectric 
counting. 


419 P. KUHNE (Berlin-Charlottenbg.). 


1) Provisional statistical estimation of the significance of differences 
between means reported without variance. 

In medical papers not uncommonly mean values are reported without 
simultaneously giving any information on the variance. By a confidence 
procedure—with N known—a limit can be estimated for the standard 
deviation beyond which a difference between means would have to be 
regarded as insignificant. A comparison with standard deviations, 
recorded elsewhere, of appropriate experimental values then permits 
inferences on differences of means unsatisfactorily reported. If the 
significance of a difference between the mean of a ‘“‘pathological’’ group 
and that of a normal one has to be tested, the ‘s’ limit of the pathological 
group can be isolated since the parameters of most clinically normal 
values can be found in tables. 

2) The use of the inverse sine transformation for adequate representa- 
tion of results expressed as ratios. 

Tests of significance of differences between series of ratios from the 
same experimental material by means of the inverse sine transformation 
can be reduced to a single calculation, since the standard deviation of 
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the difference depends on the total number of observations only. An 
empirical solution for the necessary correction for small N by entering 
the ¢-distribution with (V, + N,)/2 d.f. is given and demonstrated 
on a clinical example. Furthermore a solution is given for the problem 
of how many samples have to be taken for the detection of contamination 
of low incidence (bacterial contamination in industrial production of 
ampoulated remedies). 

3) A method for estimating the significance of rank correlations. 

In testing the correlation between continuous and coarsely grouped 
variates a “genuine”’ distribution for uncoordinated behavior depending 
solely on the total number can be calculated. This null-hypothesis of 
the difference of rank-numbers can be tested by chi-square against the 
sample distribution of experimental results. The calculable half- 
value of the distribution is chosen for chi-square grouping. 

4) The distribution of “pathological’’ values. 

Assuming the total population of any biological variate as distrib- 
uted normally the values of morbid individuals should be included 
within this total distribution as a tail of extremes. According to 
clinical usage those “‘pathological’’ values are separated from those of 


the “normal” population. This corresponds to a subtraction of two 


normal distributions differing in their ¢. The resulting theoretical 
distribution is markedly asymmetrical and is compared with empirical 
distributions encountered in clinical research. It appears that only 
values collected under policlinical or ambulatory conditions conform 
to theory whereas ward groups tend to form a symmetrical distribution. 


W. LUDWIG (Heidelberg). On Some Partially Solved or Un- 
solved Problems in Biomathematics. 


420 


A short review on problems attacked together with co-workers: 
(1) Animals belonging to a certain taxonomic group are consuming 
energy, by means of an organ I proportional to L* (L = body length), 
by means of organ II proportional to L*; what follows for the energy + 
L’—+relation in a log-log graph? Experimental verification by insect 
larvae and other animals.—(2) How many bacterial spores may z ml 
Agar culture-medium spread on a Petri dish of diameter y contain on the 
average, such that a culture of diameter z can be said to be originated 
from one single spore with a probability of 999 and over (WETTE)?— 
(3) Rate of progress of annidation (ef. LUDWIG 1950; WARTMANN 
unpubl.).—(4) Territory problems. I. Coins falling on a desk succes- 
sively; any coin overlapping one already deposited will be removed. 
Occupation density maximum?—(5) Mimicry: Proportion of indifferent 
animals to models to imitators; e.g. coral snakes of America.—(6) Dem- 
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onstration of lack of adequate statistical methods in certain ‘“regres- 
sion’ problems in biology.— 


G. OBERHOFFER (Bonn). On Some Principles of Therapeutic 
421 Trials. (Construction of homogeneous material for comparative 
trials.) 


The therapeutic trial should comply with the requirements resultant 
from the standards of an exact experimentation without violating the 
patient’s faith. The therapeutic trial always has to be based on trial 
and control groups which have to be homogeneous. Discussion of 
some important principles set up by Martini for the performance of 
therapeutic trials: with acute diseases comparison of groups consisting 
of different patients, with chronic diseases comparison of different 
periods of the same patient. 


422 K. RIEGEL (Hamburg). On the Problem of Scaling and the 
Factor Analysis of Psychological Tests. 


So far test theory has yielded only little information on the problem 
of exact scaling methods as promoted in the field of experimental 
psychology, e.g. by Thurstone, Gulliksen, Guilford, Coombs, Stephens, 
and others. Generally three ways have been used to transfer exact 
scaling methods into test theory, which may be called (1) the “‘realistic”’ 
(2) the “‘instrumentalistic’”’, and (3) the method of “item validation’”’. 
For demonstration purposes test data for a group of persons from the 
standardization sample of the Hamburg-Wechsler intelligence test were 
weighted anew in such a way as to express the sociological meaning of 
passing the items by new scores in order to improve the scaling base 
of the test. Necessarily, the reliability of the test and the subtests, 
and accordingly the usefulness of the test in a technical sense, were 
decreased by using the new scoring method. Correlations between new 
and old scores point out the danger of factor-analyzing test data of 


a similar kind and emphasise the need to limit factor analysis to item 
statistics. 


423 W.SEYFFERT (Berlin). A Method of Estimation of Interallelic 
Interaction, a.Contribution to the Gene Dosage Problem. 


In estimating the valence of the single alleles of a pair of allelomorphs 
the measurements of the different diploid and tetraploid genotypes 
which have a constant genetic background consist of the gene dosage 
and the genome dosage. The influence of each of these contributions 
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has been evaluated by the method of least squares, yielding three 
equations of the eighth degree. Using Newton-Raphson’s iteration these 
result in the following estimates: « = valence of the dominant. allele, 
y = valence of the recessive allele, z = valence of the genetic back- 
ground. (To appear in Z. Vererbgsl. 88; 1957.) 


424 F. STALLMAN (Giessen). Mathematical Notes on Electro- 
cardiography. 


The scope of electrocardiography is the inference of the distribution 
of electric elements in the heart from measurements of body surface 
potentials. By means of a theorem of Helmholtz’, based on the symmetry 
of the Green function, the problem can be reduced to the construction 
of presupposed fields of electric currents in the body interior. It can 
be shown that the elements of the heart constitute a single heart dipole, 
the position, direction, and magnitude of which can be determined 
from potential measurements. The mathematical problems concerning 
the development of this theory and methods of determination are 
outlined. (To appear in: Arch. Kreislaufforsch.) 


425 F. SULLWOLD (Gottingen). The Concept of ‘‘Simple-structure” | 
and the Problem of Factorial Invariance. 


It is demonstrated that factors extracted by the centroid method 
are to be considered as an arranging means only and that they are of 
no concrete value. The necessity is pointed out for a substitution 
of the voluntary original reference system. Discussion of the problem 
of correct or adequate position, respectively, of coordinates in view of 
the manifold of rotations, with regard to different criticisms concerning 
the same heuristic principle. 


426 E. WALTER (G6ttingen). Some Simple Unbiased Nonpara- 
metric Tests for Symmetry with Respect to Zero. 


Given n differences x; = y:; — Yo, - A class of strictly unbiased 
nonparametric rank order tests is considered for testing the hypothesis 
f(z) = f(—2), ie. the symmetry of the probability density f(x) with 
respect to zero, under the assumption of absolute continuity of the 
distribution function F(x). This class contains amongst others (a) 
a modified sign test (in contrast to the sign test by R. A. Fisher the 
absolute smallest difference is discarded) (b) a test of size a = 2°", 
for which the hypothesis f(x) = f(—2) has to be rejected when the k 
absolute largest differences have the same signs. This test is the opti- 
mum order test against distributions which are truncated but originally 
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symmetrical with respect to zero. Furthermore, this test has higher 
efficiency than the sign test when the distribution is normal and n small. 


Papers presented at joint I.ML.S. and Biometric Society (ENAR) Sessions, 
Washington, March 7-9, 1957. 


VACLAV EDVARD BENES (Bell Telephone Laboratories, 

7 Murray Hill, N. J., U.S.A.). The Joint Distribution of a Set of 

4 Sufficient Statistics for the Parameters of a Simple Telephone 
Exchange Model. 


This paper considers a simple telephone exchange model which 
has an infinite number of trunks and in which the traffic depends on 
two parameters, the calling-rate and the mean holding-time. It is 
desired to estimate these parameters by observing the model con- 
tinuously during a finite interval, and noting the calling-time and 
hang-up time of each call, insofar as these times fall within the interval. 
It is shown that the resulting information may, for the purpose of 
this estimate, be reduced without loss to four statistics. These statistics 
are the number of calls found at the start of observation, the number of 
ealls arriving during observation, the number of calls leaving during 
observation, and the average number of calls existing during the interval 
of observation. The joint distribution of these sufficient statistics is 
determined (in principle) by deriving a generating function for it. 
From this generating function the means, variances, covariances, and 
correlation coefficients are obtained. Various extimators for the pa- 
rameters of the model are compared, and some of their distributions, 
means, and variances presented. 


R. C. BOSE (Department of Statistics, University of North 
428 Carolina, Chapel Hill, N. C., U.S.A.). Ona Problem in Abelian 
Groups and the Construction of Fractionally Replicated Designs. 


Consider an Abelian group of order s", generated by n letters 
A,,A.,-°--,A, - with the relations Aj'A3* --- Ai" = I, where I is 
the identity and s isa prime. If G@ = Aj'Az* --- A" is any element 
of the group, then the number of non-zero exponents x; may be called 
the length of G. Given an integer r < n, the problem is to find a sub- 
group of order s’, generated by r independent elements G; = 
Ay" AZ? +--+ Ai’ such that the minimum length of the elements in 
the subgroup (except the length of the unit element) is greater than or 
equal to k. Consider the finite projective space PG(r — 1, s). To 
any point « = (x, , 2 , -:: , x,) of this space, assign a non-negative 
integer m, which may be considered the measure of x, in such a way 
that the total measure for the space is n. To a point of measure m 
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associate m different letters chosen out of A, , Az, --- , A, , each of 
these letters being assigned to one and only one point. Let G; = 
Aj Az"? where is the 7th coordinate of the point to which 
A, isassociated. It is proved that the length of the element GG)? - - - @ 
is the measure of the set of points not lying on the linear space 
Ait, + +A,2, = 0. Forexample let n = 10,r = 4,8 = 3. 
We can find exactly 10 points on the unruled quadrice 22,7. + 2,2, + 
+ + + 2,2, = O, in PG(3, 3). Any plane meets this 
quadric in four points, or in one point. If we assign the measure one 
to each point of the quadric and associate with it one of the letters 
A, we get G, = , G, = A , G, = 
, Gy = . The length of the words 
(other than the identity) of the subgroup generated by G, , G, , G; , G, 
must be either 6 or 9. If we take this subgroup as the fundamental 
identity for generating a 1/2* fraction in a factorial design with 10 
factors, then all the aliases of a main effect will have five or more factors, 
and ali the aliases of a two factor interaction will have four or more 
factors. 


429 BRADLEY BUCHER (Princeton University, N. J., U.S.A.). 
The Recovery of Intervariety Information. 


Assume, in the incomplete block model, y;; = m + b; + 0; + 8;;, 


that the block effects are independently distributed with mean 0 and 
variance 6, the error terms e,;; are independently distributed with 


mean—and variance a’, and that the variety effects t; , --- , & , are 
fixed effects and that ¢,,, , --- , ¢, , are independently distributed with 
mean 0 and variance y.. Then in estimating any linear combination 


of the variety effects, say a,t; + at. + --- + a,t, , we may make 
use of information among the varieties , , t, Minimum vari- 
ance linear unbiased estimates are obtained for such combinations 
for a large class of incomplete block designs. In general, these esti- 
mates have smaller variance than analogous estimates obtained using 
only inter- and intra-block recovery. For balanced incomplete blocks 
the estimate with inter-variety recovery is shown to be the same as 
the combined intra- and inter-block estimate. Several techniques are 
developed which are useful for finding estimates using intervariety 
recovery. The problem of estimating y* is discussed. Useful applica- 
tions of the technique of intervariety recovery are considered. 


J.T. CHU AND F. C. LEONE AND C. W. TOPP (Case Insti- 
430 tute of Technology and Fenn College, Ohio, U.S.A.). Some Uses of 
Quasi-Ranges II. 


In “Some uses of quasi-ranges’”’ (Ann. Math. Stat. 28, 1) methods 
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are given of using quasi-ranges to obtain confidence intervals for, and 
tests of hypotheses about, some measures of dispersion of a given 
distribution, (such as the interquantile distance and the standard 
deviation). In this paper, further research is done on the selection of 
quasi-ranges for making inferences about the standard deviations of 
the normal, rectangular, and exponential distributions. The methods 
are also extended to the coefficient of variation, the difference and ratio 
of interquantile distances and standard deviations of two given distri- 
butions, etc. Tables are given to facilitate applications. 


A. BRUCE CLARKE (University of Michigan, Ann Arbor, 
431 Michigan, U.S.A.) Maximum Likelihood Estimates in a Simple 
Queue. 


A simple stationary queuing process is a queuing process having a 
Poisson input (with parameter \), and a negative exponential service 
time (with mean 1/u, » > A). Let » = the initial queue size, 7; = 
the time of the zth arrival, y,; = the “‘busy time” up to the zth departure. 
The sequences {z;} and {y,;} then represent the transition times of 
independent Poisson processes (parameters \ and yu), and {rv}, {a,}, {y;} 
together characterize the process. By observing the process for a 
fixed ‘‘busy time’”’ 7 and using the above comment, maximum likelihood 
estimates for \ and » may be obtained in terms of », m = the total 
number of departures, 7’ = the time of the mth departure, and n = 
the total number of arrivals up to time 7. Under certain conditions 


these estimates of \ and uw may be approximated by (n + v)/T and 
(m v)/r. 


EARL DIAMOND (University of North Carolina, U.S.A.). 
432 Extension of Some Results Given by Mitra on ‘Statistical 
Analysis of Categorical Data’’. 


This is a follow up of two previous papers [(1) ‘‘“Some non-parametric 
generalizations of analysis of variance and multivariate analysis’ 
by 8S. N. Roy and 8S. K. Mitra, Biometrika, December, 1956 and (2) 
“Contributions to the statistical analysis of categorical data’’ by 8S. K. 
Mitra, North Carolina Institute of Statistics Mimeograph Series 
No. 142]. We start from a product of multinomial distributions of the 
form = Il, 251 with = 1,0 = 01,42, th; 
(a subset of s, depending on the subscript set +++ 91); Jo (82) 
ji-r © (S:-1);, and j, = 1, 2, --- , s, . We next consider two hypotheses 
Hy” = (4, subject to (Oy = 0 (m 
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1,2, --- ,u < 4) and Hy: = fi? (% , , 6.) subject to 
(Of, , 67.) = O(m = 1, 2, < te), , te < total number of 
cells — total number of multinomial distributions. Each hypothesis 
is a composite one in which the @ or 6’ are the nuisance parameters and 
yg AS and g® are known functions. Tests are taken over from 
references (1) and (2), and the asymptotic powers of the tests and the 
conditions for asymptotic independence are derived which are exten- 
sions of similar conditions for more special cases discussed in (2). 


433 JOHN J. GART (Virginia Polytechnic Institute, Blacksburg, 
Virginia, U.S.A.). An Extension of the Cramér-Rao Inequality. 


Consider a frequency function f(x | 6) where @ = (6, , 0, --- , 6), 
the function being specified when @ is specified. The parameter 6 
has a density g(@) independent of x. Let X = (x, , 2, ---,2%,) bea 
random sample from a randomly chosen population having the specified 
frequency function. Then if ¢ = [[*-, f(x; | 0) and ¢, (independent of 
6) is an estimate of 6, , 1 < k < s, there follows a form similar to the 
Cramér-Rao Inequality, 


EE((t, — 6.)° | 6] > {E[E(t. | ®) — 


The equality is reached if and only if ¢, is an unbiased sufficient statistic 
having the normal distribution with constant variance. In this case 
the equality holds regardless of the form of g(@). 


SHANTI S. GUPTA AND MILTON SOBEL (Bell Telephone 
434 Laboratories, Allentown, Pennsylvania, U.S.A.). On Selecting a 
Subset which Contains All Populations Better Than a Standard. 


Populations J], (¢ = 0, 1, --- , p) are given with a common Koop- 
man-Darmois distribution of known form differing only in the value of 
the unknown parameter 7,(¢ = 1, 2, --- , p); cases of known and un- 


known (associated with the standard [[,) are treated separately. 
Location and scale parameter problems are both treated. In some 
problems iF is defined as better than uF if 7; > 7) ;in others if 7; < 1%. 
A procedure is given in each case for selecting a small subset so that, 
for any true configuration, the probability of including all []; equal to 
or better than [], is at least P*, P* < 1 being preassigned. For the 
location parameter, with unknown, the procedure is to retain all Il: 
With = w(2;;) > — d/Vn; ; here is sufficient for 
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7,(7 = 0,1, +--+, p). For scale parameter problems, with smaller more 
preferable, the procedure retains all [],; with 0", w(z,;) < (1 + d) 
Yo", w(xo;). In several problems the value of d is computed and tables 
are given for different P* and p-values; in others transformations are 
used to “normalize”? the problem. The normal and chi-square distri- 
butions are used as applications. Problems involving binomial and 
Poisson distributions are treated separately with and without normalizing 
transformations. 


435 JOHN GURLAND (Iowa State College, Ames, Iowa, U.S.A.). 
Generalized Quantal Response in Biological Assay. 


The quantal (all-or-none) response in biological assay refers to a 
response in which one of two possible outcomes occurs. In a bioassay 
such as that of an insecticide based on mortality of the housefly, say, 
there are however three possible outcomes, namely alive, moribund, 
dead. The present paper considers a generalized quantal response in 
which two or more outcomes are possible. Whether one uses normits 
(ef. probits) or logits or other transformations, a general method of 
analyzing the data is developed which makes explicit use of all the 
possible outcomes and hence is more efficient than the common pro- 
cedure of pooling some outcomes (for example moribund and dead) 
in order to make the response all or none. Further, a technique anal- 
ogous to that used in discriminant functions is suggested as a method 
which makes more efficient use of the data than the pooling method 
mentioned above. 


ANDIE LUBIN (Walter Reed Army Institute of Research, 
436 Washington, D. C., U.S.A.). A Rank Order Test for Trend in 
Correlated Means. 


In many experiments the major interest is not in the amount of 
difference caused by the treatments but the rank-order which results. 
This is especially true when successive measurements are made on the 
same subject, and the “treatments” are simply varying amounts of 
fatigue, sleep loss, ete., i.e., some function of time. For such studies the 
null hypothesis is that no trend exists and generally the only alternative 
hypothesis is a rank-order that can be specified by the experimenter. 

A. R. Jonekheere (1954, A distribution-free / sample test against 
alternatives, 41, Biometrika) has used Kendall’s tau to obtain a general 
statistic, P, for testing the agreement between a hypothesized rank- 
order for n objects or scores and a set of observed rankings of the n 
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scores by m judges. From this general approach, he derives a test for 
trend as a special case. 

As an alternative to Jonckheere’s P, a statistic J based on Spear- 
man’s S (d*) is examined. It is the sum of the S(d’) values computed 
between the n observed rankings of the n scores and the hypothesized 
rank-order of then scores. K, the average rank order correlation between 
the m rankings and the hypothesized rank-order, a simple algebraic 
function of J. 

It is shown that J is slightly more sensitive than Jonckheere’s P 
statistic for small values of n, but that P tends to normality faster 
than J. 


437 J. A. McFADDEN (U.S. Naval Ordnance Lab., Silver Spring, 
Maryland, U.S.A.). The Variance of Zero-Crossing Intervals. 


Two expressions are given for the variance of the intervals between 
successive zeros of a random process. It is assumed that the successive 
intervals form a Markoff chain. If x(t) isa random process, let y(t) = 1 
when x(t) > 0 and y(t) = —1 when x(t) < 0. Let 8 be the expected 
number of zeros per second and let x be the correlation coefficient - 
between two successive zero-crossing intervals. Then the variance is 
o = (2A/8)(1 + x)/(1 — 2), or alternatively, = [(1 + 2B)/6’] 
(1 — x)/(1 + x) where A = fOr(r) dr and B = f% [Q(r) — 6] dr. 
r(r) is the autocorrelation function of the process y(t) and Q(r) dr is 
the conditional probability of a zero between ¢ + 7 and t + 7 + dr, 
given a zero at time ¢. 


JUNJIRO OGAWA (University of North Carolina, North 
438 Carolina, U.S.A.). A Limit Theorem of Cramér and Its General- 
ization. 


As a generalization of Doob’s theorem, H. Cramér states the follow- 
ing. Theorem: Suppose we have for every vy = 1,2, ---,y, = Az, +2,, 
where z, , y, and z, are n-dimensional random variables, while A is a 
matrix of order n X n with constant elements. Suppose further that 
as v— ©, the n-dimensional distribution of x, tends to a certain limiting 
distribution, while z, converges in probability to zero. Then y, has 
the limiting distribution defined by the linear transformation y = Az 
where z has the limiting distribution of the x, (H. Cramér: Mathematical 
Methods of Statistics, Princeton, 1946, pp. 299-300). Cramér skips the 
proof of this theorem. In this paper, the complete proof of this theorem 
will be given and two theorems which are generalizations of this theorem 
and are useful in statistics will be proved. 
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JUNJIRO OGAWA (University of North Carolina, North 

439 Carolina). On the Mathematical Principles Underlying the 
Theory of the x” Test. 


The rigorous proof of the theorem that the x’ statistic has the limiting 
chi-square distribution with degrees of freedom reduced by the number 
of the independent parameters which were estimated from the frequency 
data, was first given by H. Cramér in his famous book Mathematical 
Methods of Statistics, Princeton [1946], but some steps of the proof were 
skipped. Later on S. N. Roy and 8. K. Mitra (Biometrika, 43 [1956]) 
and 8. K. Mitra (Thesis, Univ. of N. C., 1956) reasoned along the same 
lines and got theorems adjusted to various physical situations. The 
purposes of this paper are to present a complete and self-contained 
proof of Cramér’s theorem on the one hand, and on the other to explain 
how the proof of the related theorems got by 8. N. Roy and 8. K. Mitra 
could be thrown back on that of Cramér’s theorem from the mathematical 
point of view. 


JUNJIRO OGAWA (University of North Carolina, North 
440 Carolina, U.S.A.). A Further Contribution to the Theory of 
Systematic Statistics. 


Up to 1945 the main interest of statistical estimation has been in 
the “efficient estimator,” but from the point of view of practical use, 
it seems reasonable to inquire whether comparable results could have 
been obtained by a smaller expenditure. IF. Mosteller [1946] proposed 
the use of systematic statistics in this connection. The author [1952] 
developed a systematic theory of estimation and testing hypothesis with 
respect to the location and scale parameter of a population whose 
density depends on only these two parameters. 

There are many cases in which the samples are by their very nature 
ordered in magnitude, for example in a life test of electric lamps. In 
such cases the population probability distributions are usually supposed 
to be exponential. Thus, at least for the exponential distribution, 
estimation and testing of a hypothesis based upon systematic statistics 
are of great importance from the standpoint of practical application. 

There will be presented in this paper the table of the optimum 
spacings of the selected sample quantiles, corresponding best estimators, 
and a discussion on the testing procedure of a statistical hypothesis on 
the scale parameter of the exponential distribution = (L/a)e" 
forz > 0. 
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DAVID ROSENBLATT (American University, Virginia, U.S.A.). 


441 On the Stochastic Structure of Minkowski-Leontief Systems. 


A linear system x(J — A) = w is said to be of Minkowski-Loentief 
type if A is a finite non-negative square matrix of order n with no row 
sum exceeding unity and x, w are non-negative row vectors. A non-null 
solution x of such a system is called admissible. Theorem: Every system 
of Minkowski-Leontief type x(I — A) = w which exhibits at least one 
admissible solution is equivalent to a unique system &(I — A) = 6, where 
A is a stochastic matrix depending on A and w and 6 is a null vector of 
dimension at mostn + 1. Every admissible solution of xI — A) = w 
(appropriately extended or contracted) is proportional to a convex linear 
combination of the stationary stochastic vectors of A. If A is non-stochastic, 


w ~ 6, , let A denote the matrix | } where b = (I — Abe’, 
|| 
w* = d,' w,A,, = we’, and cis the row vector with all elements unity. 


If I — ay" exists and w # 6, , there exists a single ergodic set of 
indices; if w is positive the stationary vector of A is positive. Clearly, 
# = (w(I — A)7’,X,,). If w = 6, and (J — A) is singular, A is taken as 


| 4. a where A, is the largest stochastic principal sub-matrix 
Bes il 

of A. Systems of the present type occur in economic input-output 
analysis and generally in socio-physical models based on ‘“‘balanced- 
margin”’ tables, i.e., non-negative square matrices X such that eX = eX’. 


DAVID ROSENBLATT (American University, Virginia, U.S.A.). 
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Consider a system 2(J — A) = w of Minkowski-Leontief type such 
that (I — A)~' exists. Clearly, (J — A)”' exists if and only if A contains 
no stochastic principal sub-matrix. In a static economic input-output 
context the element a,; is designated as the input (per unit output) to 
industry or activity 7 procured from industry j; w; , 2; are respectively 
final output and total output (or activity level) of the jth industry. 
Consider the uniquely corresponding system #(1 — A) = 6, where 
A is stochastic. The unique stationary stochastic vector of A is given 
by — posi). The “multiplier” = x;/d is given 
by 1/pas; Where \ = x,.,; = we’. Given a non-singular matrix (J — A), 
the following relation holds in components of an admissible solution 
for any w: — — = 0, where = we’ and 7; is 
the jth row sum in A. The latter relation is the technical production- 
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possibility function of the economy in an input-output sense; — Ar; /Az, 
= (1 —1r,)/(1 — = 1/01 — 7;),9 4 = 
are the invariant “substitution ratios” of the system, obviously inde- 
pendent of w. Let x be an admissible solution of x7 — A) = w, (I — A) 
singular or not, and let D(x, \) be a diagonal matrix with components 
of « and X on the diagonal. Then D(z, \)A is a “‘balanced-margin”’ 
table. Consistent with a noted “substitution” result, }°0"., Kjw; = 
t,.. = \ where K; = 1 for all j independently of w. 
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Consider any system «(J — A) = w of M-L type. The following 
“aggregation” problem is of interest. Let an aggregation matrix C 
be an n X r stochastic matrix of incidence type, 1 < r < n. Let 
B = {(A) be a M-L matrix of order r.. We consider conditions under 
which AC = #CB obtains for admissible solutions ¢@ of a system 
vl — A) = w. The following case is of special interest. Let a weight 
matrix & be an n X n diagonal matrix with non-negative entries on 
the principal diagonal. A consolidation of a matrix A of M-L type is an 
rXrmatrix B = B(A;C, E) = (C’EC) 'C’EAC, 1 <r <n. “Faithful 
consolidation” of a stochastic system «(J — A) = 6 is characterized 
from the standpoint of ergodic structure; the condition AC = 
C(B(A;C, £) is of particular interest. A general consolidation condition 
for M-L systems is related to the ‘combining-of-classes’”’ condition 
of stochastic learning theory. The following is of economic interest: 
the existence of (J — B)~* does not in general imply the existence of 
(I — A)‘, and conversely. In the static input-output model of II, 
the ergodic structure of A of the equivalent system (and the role of 
mean recurrence time 1/(p,.,)) suggest that the stationary stochastic 
vector 7 of A be computed iteratively using successive powers of A, 
yielding 2, in lieu of matrix inversion with or without consolidation; 
in most applications, lim,....4* exists. 


8S. N. ROY AND M. D. MOUSTAFA (University of North 

444 Carolina, North Carolina, U.S.A.). Testing of Hypotheses on a 
Mixture of Variates Some of Which Are Continuous and the 
Rest Categorical. 


We start from a k + J-variate distribution in which k variates are 
continuous and / variates are categorical. The & variates are assumed 
to have a conditional multivariate normal distribution with respect to 
the | categorical variates which are assumed to have a multinomial 
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distribution. Appropriate hypotheses are framed in this situation, 
analogous to the customary hypotheses on a single multivariate normal 
distribution (or to those in references (1) and (2) of the previous abstract), 
large sample tests of such hypotheses are developed and some of their 
properties studied. Next, instead of assuming a single multinomial 
distribution on the / categorical variates, a product of multinomial 
distributions is assumed and hypotheses are framed in this situation 
analogous to the customary ones for several multivariate normal 
distributions or to those in references (1) and (2), and large sample 
tests of such hypotheses and some of their properties are studied. 


S. N. ROY AND R. GNANADESHIKAN (University of North 
445 Carolina, North Carolina, U.S.A.). Confidence Bounds Asso- 
ciated with Multivariate Analysis of Variance. 


We start from the same set-up as in the previous paper. The S* 
and S (to be called respectively the dispersion matrix ‘due to the hy- 
pothesis” and the dispersion matrix “due to the error’) are the exact 
analogs of the variance “due to the hypothesis” and that ‘due to the 
error” in the customary univariate analysis of variance. Given any © 
level a, we can pick up a constant c, from the tables mentioned in the 
previous paper and make, with a probability greater than or equal to 
1 — a, the confidence interval statement: ¢3/2(sS*) — < 
Un] < + [se.]' where U(s X s) is a non singular 
matrix given (in the paper) in terms of A and C, and ¢2/2[n/Uy] is 
zero if and only if » = 0, i.e., Ho is true. With a joint probability 
greater than or equal to 1 — a we can also make simultaneous confidence 
interval statements including the one given above and others exactly 
similar to this but in terms of S“’, S“?*, »'” (for ¢ = 1, 2, --- , p) 
and next terms of S“'?*, (for i # j = 1, 2, , p) and 
so on, where S‘” and S‘’* stand respectively for truncated matrices 
after cutting out the 7th row and 7th column from S and S*, 7 ‘’ for 
n with the ith column cut out, S°"”, S°’"* for Sand S* with the 7th and 
ith rows and columns cut out, 7‘''’”’ for the 7th and jth columns cut 
out, and so on. 


S. N. ROY (University of North Carolina, North Carolina, 


446 U.S.A.). Multivariate Analysis of Variance. 


Consider a model under which we have stochastic variates X(p X n) 
= [x, --- x,]p such that the z; (forz = 1, 2, --- , n) are independeat 
N[E(x,), 2], E(X’) = A(n XK m) X Em X p), A (to be called the 


+4 pe. 
: 


a, 


ABSTRACTS 421 
design matrix) is a matrix of constants given by the design of the 
experiment, € is a matrix of unknown parameters, rank (14) = r <m < 
n, p < n — rand & is an unknown dispersion matrix. Under this 
model suppose we have a testable hypothesis (the meaning and mathe- 
matical criterion for testability being discussed in the paper) 
H,:C(s X m)é(m X p)M (p X gq) = 0 (s X g) where C and M (to 
be called the hypothesis matrices) are given such that rank (C) = 


- ¢<randrank (M) = q < p. The alternative is H : C&M = n(s X q) 


(~ 0). The test is that at a level a we accept Ho if ena. (S*S” "hai ie 
and reject H, otherwise, where S* and S:are matrices given (in the 
paper) in terms of X, A, C and M, ¢,,,x (7) denotes the largest root of a 
matrix with real nonnegative roots, and c, is a constant depending on a, 
min (s, g) and n — r, which we can pick up from tables now under 
construction and expected to be published shortly. 


MORRIS SKIBINSKY (Michigan State University, Michigan). 
447 A Limit Theorem and Bounds for an Optional Stopping Prob- 
ability. 

Let S; be the standardized jth partial sum of a sequence of bounded 
independent, identically distributed random variables, K, a positive 
constant, and let 

Q(m,n, K) = Pr} max 8; > K} 
It is shown by elementary methods that if lim,,.. ((n — m)/VW m ys=i@ 
then lim,,... Q(m, n, K) = 1 — $(k), where ¢ is the standard normal 
e.d.f. Certain steps in the proof are then used to obtain simple bounds 


for Q(m, n, K) when the sequence of random variables is generated 
from Bernoulli trials. 


EVAN J. WILLIAMS (North Carolina State College, North 


448 Carolina, U.S.A.). On Statistics Independent of a Sufficient 
Statistic. ‘ 


It is shown that if, for a sample drawn from a population of values 
of x with distribution depending on a parameter 6, the statistic z is 
sufficient for @, and g is any statistic whose distribution is independent 
of @, then g and z are independently distributed. The method of proof 
is less sophisticated than that of Basu (Sankhya, 15: 377 [1955]). 

The result has application to the normal distribution: the mean of a 
sample is distributed independently of any location-free statistic; and 
to the gamma distribution: the.mean of a sample is distributed inde- 
pendently of any scale-free statistic. These well known results follow 
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since the sample mean is a sufficient statistic, in the former case for the 
location parameter, in the latter case for the scale parameter. 

The limitations of the general result lie in the difficulty of deriving 
statistics independent of parameters other than location and _ scale 
parameters. 

The connexion of the theorem with estimation theory is discussed, 


MARVIN ZELEN (Statistical Engineering Laboratory, Wash- 
449 ington 25, D. C.). The Use of Incomplete Block Designs for 
Asymmetrical Factorial Arrangements. 


Let A, (s = 1, 2, --- , m) denote the sth factor in a m-factor fac- 
torial experiment such that A, has m, levels. Let = , , 
represent a particular experimental combination of the m-factors and 
let the mathematical model underlying the measurements be 


t=2 s=1 


where (a,);, Tepresent the various main 
effects and interactions, b; represents the block effect, and the e,; are 
NID (0, o”). Algorithms are given for using the balanced incomplete 
and the group divisible designs for asymmetrical factorial arrange- 
ments. Let J/(s) be the square matrix (of dimension J/,) M/(s) = 
m,I — J where J is a matrix having all elements unity, and define the 
direct product of p such matrices by M(1, 2, --- , p) = [A/(1) X 
M(2) X --- X M(p)| (p < m). Then the variance-covariance matrix 
of a p-factor interaction for the G.D. case can be written as 
t = lor2. For the BIBD, the same expression 
holds with £, = EF, . The correlations between the different inter- 
actions are all zero and since M°(1, 2, --- , p) M(1, 2, --- , p) 
IL m, , TI? m,| follows a with (m, — 1) 
degrees of freedom under the hypothesis of no p-factor interaction 
effects. 
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INTERNATIONAL 
International Statistical Institute 


The 30th meeting of the International Statistical Institute was held 
in Stockholm from August 8—14, 1958. Three of the sessions were 
organized by the Biometric Society, and the following papers were 
presented: Problems of Experimentation (organizer, B. Matérn): G.E.P. 
Box, Elucidation of basic mechanisms; O. L. Davies, Screening tests 
in the pharmaceutical industry; D. J. Finney, Statistical problems of 
plant selection; 8. H. Justesen and M. Keuls, Note on nonorthogonal 
designs. Statistical Genetics (organizer, L. L. Cavalli-Sforza): J. A. Book, 
Practical applications of the theory of population genetics to man; 
R. A. Fisher, Polymorphism and natural selection. Statistics in Medical 
Research (organizer, L. Martin): J. Cornfield, J. Steinfeld and 8. 
Greenhouse, Models for medical experiments using isotopically labelled 
tracers; B. G. Greenberg and A. E. Sarhan, Some applications of order 
statistics; J. O. Irwin, Experiments on the carcinogenic action of mineral 
oils; A. W. Kimball, Disease incidence estimation in populations sub- 
ject to multiple causes of death; D. Schwartz and G. Anguera, Une 
cause de biais dans certaines enquétes médicales; la temps de séjour a 
lhopital; W. Billewicz, Practical problems in a sequential medical trial. 


REGIONAL 
Région Belge 


La Société Adolphe Quetelet, présidée par Monsieur R. Laurent, a 
eu le plaisir de recevoir 4 sa tribune, le 27 mars dernier, Monsieur 
Raymond van den Driessche, Ingénieur des Eaux et Foréts A. I. Gx. 
et Chef de Travaux de la Division de Biometrie de l’I.N.E.A.C. (Yan- 
gambi, Congo Belge). Cette conférence qui eu lieu 4 la Fondation 
Universitaire, portait sur le sujet suivant: “La place dans |’expéri- 
mentation agricole des plans en blocs incomplets partiellement équili- 
brés.”’ 


British Region 


A meeting held on May 30th, 1957 was devoted to problems of 
human blood groups, the following papers being presented:—J. A. 
Fraser Roberts, Blood Groups and Susceptibility to Disease; J. H. 
Renwick and Sylvia Lawler, Mapping Human Chromosomes; A. E. 


Mourant, The Nature and Possible Interpretation of Blood Group 
Population Data. 
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The summer meeting of the Region was held on July 11th, 1957, 
when a visit was paid to the Central Veterinary Laboratory, Weybridge. 


Région Frangaise 


A une réunion tenue le 19 juin, Mm. J. Sutter et I. — N. Toan ont 
présenté une communication entitulée — Problémes posées par |’étude de 
la dispersion des génes dans les populations humaines occidentales. 


Switzerland 


The Biometric Seminar for Agronomists held in Zurich (July 22-26) 
at the Swiss Federal Institute of Technology was attended by 31 par- 
ticipants. Introductory lectures on statistical methods were given by 
Professor Linder (Genf), Professor Loértscher, Mr. Abt, Dr. LeRoy 
(Zurich), and Dr. Auer (Zouz). 

The seminar was organised by the Swiss Branch of the Biometric 
Society. Assistance was received from the Swiss Society of Agronomy 
Engineers. 

At the Annual Meeting of the Swiss Branch, held on July 27 in 
Zurich, the following papers were read: A. Linder (Genf), Die Bedeu- 
tung und Berechnung der Streuungskomponenten; Henri L. LeRoy 
(Zurich), Populations-genetische Betrachtungen zur Erbwertschitzung. 


WNAR 


The annual meeting was held at Stanford University, August 27 
and 28 jointly with the American Institute of Biological Sciences and 
the AAAS (Pacific Division). The invited address, given by Dr. W. L. 
Smith of Cambridge, was entitled ‘‘A random walk problem arising 
from the study of the flour beetle.”’ 

The program included four sessions: Problems in Discriminatory 
Analysis (presided over by 8. W. Nash): papers by Rosedith Sitgreaves, 
A. C. Walker and W. F. Royce; Epidemiology (presided over by W. J. 
Dixon): papers by M. F. Laughlin, H. B. Messinger, M.S. Ahmed and 
W. I. Taylor; Contributed Papers (presided over by D. G. Chapman): 
papers by J. Gurland and D. G. Chapman; Genetics (presided over by 
IF. T. Schultz): papers by B. O. Berg, W. Becker, H. Abplanalp, R. W. 
Allard, E. Novitski, Dorothy Lowry and A. C. Walker. 


MEMBERSHIP 


The paid-up membership of the Society at the end of 1956 was 1303, 
divided between regions as follows: 


2 
| 


THE BIOMETRIC SOCIETY 425 


Australasian 65 India 14 
Belgian 60 Italian 50 
Brazilian 68 Japan 47 
British 162 Netherlands 33 
Denmark 14 Sweden 14 
ENAR 473 Switzerland 22: 
French 67 WNAR 96 
German 69 At large 49 
MEMBERS 


The following notifications of changes of address and of location of new 
members were received during May-July, 1957. 


New Addresses 


Mr. John C. Bain, 32 South Munn, East Orange, N. J., U.S.A. 
Dr. Samuel H. Brooks, 1260 Decisadero, Pacific Grove, California, 
U.S.A. 
Mr. Victor Chew, Department of Experimental Statistics, North 
Carolina State College, Box 5457, Raleigh, North Carolina, U.S.A. 
Dr. Ralph E. Comstock, Department of Animal Husbandry, Institute of 
Agriculture, University of Minnesota, St. Paul 1, Minnesota, U.S.A. 

Dr. William 8S. Connor, Statistical Engineering Lab., National Bureau 
of Standards, Washington 25, D. C., U.S.A. 

Mr. Ellsworth B. Cook, Naval Med. Field Res. Lab., Camp Lejeune, 
N. C., U.S.A. 

Dr. L. Otis Emik, Air Pollution Med. Prog., Public Health Service, 
DHEW, Washington 25, D. C., U.S.A. 

Dr. Gerald Friedman, 65 Park Terrace East, Apartment C-66, Man- 
hattan, New York 34, N. Y., U.S.A. 

Mr. Roger Heimlich, 222 Saf., Columbus, Indiana, U.S.A. 

Dr. Henry Klaunberg, National Press Building, Washington 4, D. C., 
U.S.A. 

Dr. Eschscholtzia L. Lucia, ¢/o Mrs. H. B. Stevenson II, 1358 Weller 
Way, Sacramento 18, California, U.S.A. 

Judson McGuire Jr., Apartada 654, Camaguez, Cuba 

Dr. A. M. Mood, General Analysis Corp., 11753 Wilshire Blvd., West 
Los Angeles, California, U.S.A. 

Mr. Jack Moshman, Council for Economic and Industry Research, Inc., 
1200 Jefferson Davis Highway, Arlington 2, Virginia, U.S.A. 
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Mr. Sydney I. Neuwirth, 71-08 Parsons Boulevard, Flushing, New York, 
U.S.A. 

Dr. Bernard Ostle, ¢/o W. E. Boyes, Dept. 1290, Sandian Corporation, 
Sandia Base, Albuquerque, New Mexico, U.S.A. 

Dr. Earl R. Rich, Dept. of Zoology, University of Miami, Coral Gables 
46, Florida, U.S.A. 

Miss Jean Roberts, 8103 Kentburg Drive, Bethesda 14, Maryland, 
U.S.A. 

Dr. David W. Robertson, Dept. of Agronomy, Colorado State University, 
Fort Collins, Colorado, U.S.A. 

Mr. O. K. Sagen, 133 Carroll Street, SE, Washington 3, D. C., U.S.A. 

Morton D. Schweitzer, Ph.D., 600 West 168 St., New York 32, N. Y., 
US.A. 

M. C. K. Tweedie, Department of Mathematics, The University, 
Manchester 13, Great Britain. 

Prof. David F. Votaw, J. R., Sheffield Hall, Yale University, New 
Haven, Connecticut, U.S.A. 

Dr. John EF. Walsh, 5019 Donna Avenue, Tarzana, California, U.S.A. 


New Members 
Belgian 


H. A. L. J. Amand, I.N.E.A.C., Yangambi, Belgian Congo 

P. Dineur, I.N.E.A.C., Gandajika, Kasai, Belgian Congo 

Dr. O. Lazer, 75a, Bd. de Cabbeck, Tirlemont, Belgium 

M. J. W. Luttgens, Yangambi, B.P. 37, Belgian Congo 

T. J. Marymen, I.N.E.A.C., Gandajika, Kasai, Belgian Congo 


British 


Dr. Allen Birnbaum, Mathematical Department, Imperial College, 
Exhibition Road, London, 8.W. 7, England 

Mr. R. M. Cormack, Department of Statistics, Marischal College, 
Aberdeen, Scotland 

Mr. R. C. Elston, 161 Queen’s Drive, London, N. 4, England 

Dr. Authur L. Jolly, Imperial College of Tropical Agriculture, St. 
Augustine, Trinidad, B.W.I. 

Mr. C. D. Kemp, The Grassland Research Station, Hurley, Near 
Maidenhead, Berks, England 

Mr. W. L. B. Nixon, Genetics Unit, Institute of Psychiatry, Denmark 

Hill, London, 8.E. 5, England 
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ENAR 


Mr. Ross Willard Adams, 401 Natural History Building, University of 
Illinois, (Dept. of Physiology), Urbana, Illinois, U.S.A. 

Dr. Chester Alexander, Westminster College, Fulton, Missouri, U.S.A. 

Mr. Earl L. Atwood, 7117 Woodland Avenue, Takoma Park 12, Mary- 
land, U.S.A. 

Dr. Donald W. Bailey, R. B. Jackson Memorial Laboratory, Bar 
Harbor, Maine, U.S.A. 

Dr. Harold J. Bornhold, 27 East Eleventh Street, New York 3, N.Y., 
U.S.A. 

Dr. Hugh Bishop Cannon, Horticulture Division, Central Experimental 
Farm, Ottawa, Ontario, Canada 

Dr. C. K. Chai, Jackson Laboratory, Bar Harbor, Maine, U.S.A. 

Dr. Ernest L. Corley, Dept. of Dairy Husbandry, University of Wis- 
consin, Madison, Wisconsin, U.S.A. 

Mr. Richard J. Daum, Department of Entomology, Cornell University, 
Ithaca, New York, U.S.A. 

Mr. Howard H. Engelbrecht, U. 8. Weather Bureau, Iriendship Inter- 
national Airport, Baltimore, Maryland, U.S.A. 

Mr. Peter W. Frank, Department of Zoology, University of Missouri, 
Columbia, Missouri, U.S.A. 

Mr. John J. Gart, Department of Statistics, Virginia Polytechnic 
Institute, Blacksburg, Virginia, U.S.A. 

Prof. Dr. James A. Hagans, Experimental Therapeudic Unit, Dept. of 
Medicine, University of Oklahoma School of Medicine, Oklahoma 
City, Oklahoma, U.S.A. 

Mr. Jimmy H. K. Kan, Department of Poultry Science, Texas A and 
M College, College Station, Texas, U.S.A. 

Miss Katherine Baker Laughton, 174 Pearl Street, Burlington, Vermont, 
U.S.A. 

Harry H. Shorey, Dept. of Entomology, Cornell University, Ithaca, 
N.Y., U.S.A. 

Dr. J. N. Spuhler, Institute of Human Biology, University of Michigan, 
Ann Arbor, Michigan, U.S.A. 

Dr. N. T. Werthessen, S.W. Foundation for Res. and Education, 
San Antonio, Texas, U.S.A. 

Miss Joan Wax, Research Laboratories, Parke, Davis and Co., Detroit 
32, Michigan, U.S.A. 


WNAR 


Robert W. Allard, 223 Hunt Hall, University of California, Davis, 
California, U.S.A. 
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German 


Hilmar Grimm, Jena/Thur DDR, Prof.—Ibrahim—Str. 16, Germany 

Vladimir Maly, Karlovo nam 32, Prague, CSR 

Prof. Dr. Heinz Rethmann, Kiel/Holstein, Esmarchstr. 42, Germany 

Erwin Roth, (13a) Markbreit/Main, Enheimer Strasse 481, Germany 

Prof. Dr. Wilhelm Rudorf, Max-Planck Institut fur Zuchtungsforchung, 
Voldagsen uber Elze/Hann, Britische Zone, Germany 

Dr. August Schaaf, Schillerstr. 9, Berlin-Eichwalde (DDR), Germany 

Karl Schrimpf, Institut fur Pflanzenvau und Pflanzenzuchtung, Stutt- 
gart, Hohenheim, Germany 


Netherlands 
D. K. deJongh, Jacob van Ruysdaellaan 25, Heemstede, the Netherlands 


Indian 


Mr. K. C. Agrawal, Senior Statistical Assistant, Government Horti- 
cultural Research Station, Saharanpur, India 

Mr. Y. N. Bali, Statistical Officer, Cane Commissioners Office, U.P., © 
Lucknow, India 

Mr. Rughuvir Giri, Statistician, Directorate of Land Records, Madhya 
Pradesh, Gwalior, India 

Mr. G. P. Kapoor, Deputy Cane Commissioner, U.P. Lucknow, India 

Dr. P. 8S. Lamba, Principal, Rafi Ahmed Kidwai Agricultural Institute, 
Sehore (Bhopal), M.P., India 

Dr. B. Mukerji, Director, Central Drug Research Institute, Chattar 
Manzil Palace, Lucknow, India 

Mr. D. P. Singh, Director, Planning Research and Action Inst., U-P., 
Kalakankar House, Lucknow, India 


Italian 


Dr. Ruggero P. N. Ceppellini, Instituto Sieroterapico Milanese, Via 
Darwin 22, Milano, Italy 

Dr. J. Gani, Department of Mathematics, University of Western 
Australia, Nedlands, W. A., Australia 

Dr. Giulio Alfredo Maccacaro, Instituto di Igiene, Via Forlanini 1, 
Pavia, Italy 

Dr. U. Prota, Instituto di Patologia Vegetale, Pizza Conte Moriana 8, 
Sassari, Italy 

Dr. Marcello Siniscalco, Inst. Genetics, Univ. of Naples, Messocannone, 

8, Naples, Italy 
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Japanese 


Prof. Masahiro Aya, Agricultural Chem. Insp. Station, 844," Suzuki- 
shinden, Kodaira-machi, Kitatama-gun, Tokyo, Japan 


Professor Shiro Chikuni, Naikai Reg. Fisheries Res. Lab., ¢/o Fisheries 
Lab. 1328, Ujina-machi, Hiroshima, Japan 

Mr. Akio Kudo, Faculty of Science, Kyushyu University, Hakazaki, 
Fukuoka, Japan 
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NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secre- 
tary (if members at large to the General Secretary) news of appointments, 
distinctions, or retirements and announcements of professional interest. 


Dr. Bernard Ostle, formally Professor of Mathematics and Director 
of the Statistical Laboratory at Montana State College, is now with 
the Reliability Department of Sandia Corporation, Albuquerque, New 
Mexico. His mailing address is 3120 Cardenas Drive N.E., Albuquerque, 
New Mexico. 


EDUCATIONAL TESTING SERVICE FELLOWSHIPS 


The Educational Testing Service is offering for 1958-59 its eleventh 
series of research fellowships in psychometrics leading to the Ph.D. 
degree at Princeton University. Open to men who are acceptable to 
the Graduate School of the University, the two fellowships each carry 
a stipend of $2,650 a year and are normally renewable. Fellows will be 
engaged in part-time research in the general area of psychological 
measurement at the offices of the Educational Testing Service and will, 
in addition, carry a normal program of studies in the Graduate School. 

The closing date for completing applications is January 3, 1958. 
Information and application blanks may be obtained from: Director of 
Psychometric Fellowship Program, Educational Testing Service, 20 
Nassau Street, Princeton, New Jersey, U.S.A. 
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